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Preface 



The classical phenomenon of light scattering is one of the most studied top- 
ics in light-matter interaction and, even today, involves some controversial 
issues. A present focus of interest for many researchers is the possibility of 
obtaining information about microstructures, for example surface roughness, 
and the size, shape and optical properties of particles by means of a non- 
invasive technique such as the illumination of these objects with light. One of 
their main tasks is to extract the relevant information from a detailed study 
of the scattered radiation. This includes: measurement of the light intensity 
in different directions, analysis of its polarization, determination of its statis- 
tics, etc. Contributions to resolving this problem are important not only from 
the point of view of increasing basic knowledge but also in their applications 
to several fields of industry and technology. Consider, for example, the possi- 
bility of distinguishing between different types of atmospheric contaminants, 
biological contaminants in our blood, the detection of microdefects in the 
manufacturing of semiconductors, magnetic discs and optical components, or 
the development of biological sensors. 

During the period September 11-13, 1998, we brought together a group of 
international experts on light scattering at the Summer School of Laredo at 
the University of Cantabria. In a series of one-hour lectures, they discussed 
current aspects of light scattering from microstructures with special emphasis 
on recent applications. 

The present book condenses those lectures into five parts. After a general 
introduction to light scattering from microstructures, three important topics 
are reviewed in the theoretical section (Part I) : the basic electromagnetic the- 
ory for dealing with microstructures on substrates, the mathematical methods 
for data inversion, and an original presentation of the Mueller matrix as a 
tool for analyzing light scattering experiments. 

Part II reviews some of the most relevant numerical methods used for 
solving the scattering problem for particles on substrates. It addresses basic 
geometries for the particles (sphere and cylinder) and finishes with the more 
general problem of irregular structures. 

The polarization of scattered radiation by microstructures is also reviewed 
(Part III) in two contributions. These describe, among other things, the mul- 
tiple scattering and depolarization effects caused, respectively, by very small 
particles and by those of the order of, or larger than, the incident wavelength. 

The analysis of the statistics of scattered radiation can give insight into 
solving the “inverse problem” (Part IV). Two chapters are dedicated to pre- 
senting some background on this interesting topic. They show how informa- 
tion can be obtained about scattering particles when they are isolated or, 
alternatively, located on substrates. 




VIII 



Preface 



Finally, the last and most extensive part of the book, Part V, contains 
some of the most relevant applications of scattering from microstructures 
in both basic and applied research. These include, among others. Surface 
Enhanced Raman Scattering (SERS), monitoring and detection of surface 
contamination by small particles, optical communications, and new particle 
sizing techniques. 

An essential feature of this book is that it has been organized in a tutorial 
way, so that both researchers and students (mainly postgraduates) can famil- 
iarize themselves with these problems. To guarantee the objectivity and the 
scientific quality of this book, all of the contributions have been written by 
internationally renowned researchers in the field of light scattering. We want 
to express our gratitude to all of them for their excellent work. Of course, all 
of these efforts would have been in vain without the collaboration of many 
individuals, and public and private organizations. Hence we wish to thank: 
both the academic and administrative staff of the organization team of the 
Summer School of Laredo at the University of Cantabria, the Ministry of 
Education and Culture of Spain for its economic support under grant No. 
C097-0146 and the research project PB97-0345. Other public support came 
from the Consejeria de Educacion y Juventud (Gobierno de Cantabria) and 
the University of Cantabria through the Vice-rector of Research, the Fac- 
ulty of Sciences and the Department of Applied Physics. Our gratitude goes 
out to all of them. We are also grateful for private financial support from 
Hewlett-Packard, Lasing S.A. and Optilas S.A. Finally, many, many thanks 
to our colleagues Gorden Videen, Pedro Valle, Jose Maria Saiz and Jose Luis 
de la Pena for their invaluable help in reviewing the contributions and editing 
the manuscript. 



Santander, 
September 1999 



Fernando Moreno 
Francisco Gonzalez 
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1 Introduction to Light Scattering from 
Microstructures 



F. Gonzalez and F. Moreno 

Departamento de Fisica Aplicada, Gmpo de Optica. Universidad de Cantabria. 
39005 Santander (SPAIN) 



When electromagnetic radiation interacts with microstructures (either 
isolated or located on a surface) whose size is of the order of the incident wa- 
velength, scattered radiation can be detected in all directions. Furthermore, 
the electromagnetic characteristics of the scattered wave in a given direction 
may be changed due to the interaction of the incident radiation with the 
microstructure. This constitutes an important electromagnetic problem since 
long time ago and, in order to solve it some information has to be obtained 
about the physical scattering structure (i.e. size, shape, optical properties, 
etc). The interest in solving this problem is not only from a basic point of 
view but for the applications in many fields. For instance, microstructures 
in volume are common in areas like meteorology, astronomy, biology, radar, 
atmospheric contamination, optical oceanography, etc.; whereas microstruc- 
tures on surfaces are common for instance in the semiconductor industry, high 
reflectance surface contamination, SERS, optical particle sizing, etc. Because 
of these applications, many researchers have developed theories and numeri- 
cal methods, and have carried out experiments in order to solve those elec- 
tromagnetic scattering problems. Recent scientific meetings in close relation 
with the scattering of electromagnetic waves from microstructures confirm 
the current interest of the scientific community [1]. A number of issues have 
been devoted to the subject in some journals [2-4], and also several books of 
collected works (theoretical and applied) have been edited [5,6]. 

Obviously, a review of all the theories developed for solving the problem 
of scattering from microstructures, or even a list of the works related with 
the topic, is an ambitious undertaking. The aim of this chapter is to show 
only some basic aspects of its solution and to describe some of the methods 
most frequently used, depending on the size, shape and optical constants of 
the microstructures. We think this introduction will help the reader to under- 
stand the different contributions of the book, each one focused in a particular 
aspect. We distinguish between the isolated particle and the particle or micro- 
structure on a substrate. For the first case it is worthwhile mentioning here 
some excellent reviews [7,8] in which some of the most relevant works are 
described. 

The chapter is structured as follows: Section 1 is devoted to describe the 
basic electromagnetic tools. In Sects. 2 and 3 we survey the most commonly 
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2 F. Gonzalez and F. Moreno 

used methods in scattering from, respectively, isolated regular and irregular 
microstructures. In particular, in Sect. 3 we shall describe three methods 
that, to our opinion, are the most powerful from the calculus point of view: T- 
matrix method. Discrete Dipole Approximation method and Finite-Difference 
Time-Domain method. Finally, Sect. 4 is entirely devoted to describe the most 
important models in the analysis of the scattering from microstructures on 
surfaces. 

1 Electromagnetic Theory 

The basic tools for the solution of the scattering of electromagnetic waves 
from microstructures are the Maxwell equations. Integral theorems applied 
to these equations and appropriate boundary conditions of the problem con- 
stitute a general operational scheme. As it is well known, an electromagnetic 
wave propagating in a medium is characterised by its electric field vector E 
and its induction magnetic vector B. In the medium, the response to these 
fields is given by the electric displacement vector D and the magnetic field 
vector H. These macroscopic magnitudes satisfy Maxwell’s equations: 







(1) 


BB 




(2) 


<1 

U 

II 


(3) 


V-B = 0 


(4) 



where j and p are the electric current density and the charge density, respec- 
tively. D and H are related to E and B by means of the relations 



D = eo£rE (5) 

H=^^B (6) 

flQl-lr 



where e = epCr is the dielectric constant of the medium, eo is the dielectric 
constant of vacuum and is the relative dielectric constant, p, = poPr is the 
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magnetic permeability of the medium, /tq is the permeability of vacuum and 
fXr is the relative permeability. Furthermore, j and E are related by Ohm’s 
Law: 



j = (7) 

(7c being the electric conductivity of the medium. 

The solution of a problem in a given region of space requires the solution 
of the Maxwell equations in the medium, which is somehow limited by other 
media. Around the surface S separating regions 1 and 2 constituted by dif- 
ferent media, some boundary conditions must be satisfied by E, B, D, and 
H. Let rii 2 be the unitary surface vector, from medium 1 to medium 2 in a 
given point of surface S. The boundary conditions are: 



ni2 X (E 2 — El) = 0 (8) 

rii2 X (H 2 — Hi) = js (9) 

ni2 • (D 2 — Di) = PS (10) 

ni2-(B2-Bi) = 0 (11) 



where js and ps are the electric current density and the charge density at 
the surface, respectively. 

For the case of a homogeneous and isotropic medium, and if the electric 
current and charge densities are null at the surface, js = 0, ps = 0, the 
Helmholtz equation is derived from eqs. 1-4: 

= 0 (12) 

where k = ojy/tfi, is the spatial part of the electric field, uj is the angular 
frequency, E(r), or magnetic field, H(r), and the time dependence is assu- 
med exp (—iut). The solution of the Helmholtz equation, with the boundary 
conditions of (8)-(ll) constitute a description of the electromagnetic field in 
any region of space. 
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2 Isolated Regular Particles 

We refer to a regular, or simple, particle as one whose interfaces can be de- 
fined by having one of the components of a coordinate system set equal to 
a constant; e.g., a sphere can be defined by r = C, a cylinder by p = C, 
a plane hy z = C, etc. The simplest microstructure is a sphere. For such a 
system it is most convenient to express the wave equation in spherical coor- 
dinates and carry out a separation of variables. The incident, internal and 
scattered electromagnetic fields can be expressed in terms of vector spherical 
harmonics, and the coefficients of the harmonic expansions, which represent 
the scattering solution, depend on the composition and morphology of the 
microparticle. The first studies, carried out by Mie, Lorentz and Debye, dealt 
with the scattering of an incident plane wave by a non-magnetic, isotropic, 
homogeneous, spherical particle placed in a non-absorbent medium. This is 
often referred to as Mie theory. A detailed study of this solution can be found 
in the classic literature of the field (van de Hulst [10], Kerker [11], and Bohren 
and Huffman [12]). For nonspherical particles, rigorous analytic scattering so- 
lutions exist only for a few special cases: very small particles compared with 
the wavelength or infinitely long cylinders. More recently [13-16], exact so- 
lutions were found for spheroidal particles by means of first expressing the 
vectorial wave equation in spheroidal coordinates for the separation of varia- 
bles, and second, expanding the incident, internal and scattered field vectors 
in spheroidal wave functions. As before, the coefficients in the expansion are 
the unknowns to be determined by the boundary conditions. 

These analytic solutions may be applied to particles of all sizes, but nume- 
rical results are not always trivially calculated. Convergence and truncation 
criteria contribute to the difficulties. Algorithms and numerical methods have 
been developed to assist in the calculations [17-20]. In [20] the programs are 
based on the T-matrix method that is described below. 

Though analytic solutions are possible only for simple geometries, nume- 
rous works focus on scattering by inhomogeneous, anisotropic particles ha- 
ving irregular geometries, and also consider different profiles of the incident 
beam (Gaussian, for instance). A solution for a coated sphere (homogeneous 
core covered by a uniform layer of a different material) was first obtained 
by Aden and Kerker [21]. A reformulation of that theory was employed by 
Toon and Ackerman [22] to perform a numerical method for the calculation 
of scattering from concentrically coated spheres. The radiation of a spherical 
core-shell system was also calculated by Fuller [23], using the vector spherical 
harmonics. A complete analytical study of the multilayered spherical particle 
as an extension of the Mie solution was carried out by Bhandari [24], alt- 
hough the computational scheme of his results require the implementation of 
complex algorithms. More recently Wu and Wang [25], Kai and Massoli [26], 
and Wu et al. [27] proposed more stable and accurate recursive procedures 
for numerically computing of the scattering coefficients. 
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Another particle system of interest is the case of host particles having 
eccentric inclusions. The use of multiple- multipole expansions have allowed 
researchers to satisfy boundary conditions for these cases [28-31]. In this 
method, vector harmonics are expanded about multiple coordinate locati- 
ons corresponding to the center of each inclusion. These expansions can be 
translated to the host coordinate system and the boundary conditions can be 
satisfied on all locations simultaneously. The same procedure has been used 
to calculate the scattering from aggregates [32-35]. 

3 Isolated Irregular Particles 

In many situations where the knowledge of light scattered by microstructu- 
res has some relevance, the structures have a non-spherical shape and/or are 
nonhomogeneous. Sometimes the information provided by the solution to less 
complicated cases (like those discussed above) may approximate the system 
sufficiently that the results are meaningful, but sometimes the system is too 
irregular, and we need to resort to a specific solution. In these cases, the re- 
quired computational methods are directly based on the numerical solution 
of Maxwell’s equations, either in differential or integral form. Here we refer 
to the most powerful methods from the calculus point of view, though the 
computational requirements may be very high in some cases. In particular we 
shall describe briefly the T-matrix method and the Discrete Dipole Approxi- 
mation (DDA) based on the integral form of the Maxwell equations, and the 
Finite-Difference Time-Domain (FDTD) technique, based on the differential 
form of the Maxwell equations. 



3.1 Integral Methods 

One approach to the scattering problem utilizes an integral formulation of 
the Maxwell equations. The starting point is the differential form of Ma- 
xwell’s equations. By applying the appropriate boundary conditions one ob- 
tains more compact equations by means of the application of well known 
integral (Green) theorems. An example is the so called extinction theorem 
(ET) of the physical optics, which is derived from the Green theorem [9]. In 
this method two surface integral equations are derived. The first is used to 
calculate the sources along a surface S induced by the incident field 



E(rs) = Eo(rs) 



1 

iTrefeg 



V X V X 



/ 



E(r') 



dG{rs,r') 

dn 



G(js,r') 



5E(r')' 

dn 



dS 



(13) 



where Eo(rg) is the incident field at the point belonging to the surface 
S, E(r') and E(rg) are the total fields (incident plus scattered) at points r' 
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and Ts respectively, G(rg,r') is the Green’s function, n is the unit outward- 
pointing normal of the surface, ko is the wavenumber in vacuum. In order 
to calculate the total field at any point in space due to the the sources, the 
following equation is evaluated: 

■=<■■> = X ^ X / 

s 

(14) 



an 



9n 



dS 



3.2 T-Matrix 

The T-matrix method, also known as the extended-boundary-condition me- 
thod (EBCM) provides a general formulation of the scattering from obstacles 
of arbitrary size and shape, with size ratios from the Rayleigh region to the 
geometric optics limit. An excellent review of the current status of the T- 
matrix method has been published by Mishchenko et al. [36]. This method 
is a numerical technique based on the integral formulation of the Maxwell 
equations. The electromagnetic fields (incident, internal and scattered) are 
expanded in vector spherical harmonics. After applying the boundary condi- 
tions, a matrix is obtained, relating the known expansion coefficients of the 
incident field (a, b) to the unknown expansion coefficients of the scattered 
field (f,g): 



f 

9 




(15) 



This matrix is called the transition or T-matrix and depends only on the 
physical properties of the scattering object (shape, size parameter, and optical 
constants) as well as on its orientation with respect to the coordinate system. 
One enormous advantage of obtaining a T-matrix is that it is necessary to 
compute it only once and the results can be used in computations at any 
incident or scattering direction. Orientation averaging can also be performed. 

The T-matrix method was developed in a series of papers by Waterman 
[37,38], and has been widely used in scattering by microstructures. Barber 
and Hill [20] have published a book including the T-matrix software for sphe- 
res, cylinders and spheroidal particles. The T-matrix method has been used 
in many scattering investigations of many types of particles: non-spherical ho- 
mogeneous [16,39-42] and inhomogeneous [43], polydisperse samples of non- 
spherical homogeneous [44,45] and inhomogeneous [46] particles, bispheres 
[47] and arbitrarily shaped scatterers [48-52]. 



3.3 Discrete Dipole Approximation (DDA) Method 

The discrete-dipole approximation (DDA) for computing scattering and ab- 
sorption by particles was originally developed by Purcell and Pennypacker 
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[53] . It is a very general method for calculating the scattering and absorption 
of electromagnetic waves by particles of arbitrary shape and size comparable 
to the wavelength. In the DDA method, the scattering system is discretized 
and represented by an array of N point dipoles, with the spacing between 
dipoles small compared to the wavelength. Each dipole j is characterized by 
a polarizability tensor aj such that Pj = ajEj, where Pj is the instantaneous 
dipole moment and Ej is the total instantaneous electric field at position j 
due to the incident field on the dipole plus the effect of the other — 1 
dipoles 

E, = E‘“ (r,) + ^ A.-fcOfeEfc (16) 

i¥=k 

where the matrix includes the geometric factors relating the electric field 
at Vj due to the radiating dipole at r^. This equation may be solved using a 
variety of techniques including matrix inversion and from this knowledge the 
scattering properties of the particle may be evaluated. The total scattered 
field at the observation point robs is then 

(robs) = ^ A, (robs) (17) 

i 

where the matrix A.j (robs) includes the geometric factors relating the electric 
field at robs due to the radiating dipole at Vj. 

The DDA Method appears in the literature as the Coupled Dipole Method 
(CDM), used for instance by Singham and Bohren [54,55] and Dungey and 
Bohren [56] to calculate the scattering by a homogeneous spherical or non- 
spherical particle and by Bourrely et al. [57] to introduce a new formulation 
to discretize a homogeneous or inhomogeneous particle of arbitrary shape in 
terms of multiple partitions. 

An excellent review concerning the DDA method has been published by 
Draine and Flatau [58], that includes a comparison with other methods. An 
important conclusion reached in that work is that for targets with |m| < 2 (m 
being the complex refractive index) , scattering and absorption cross-sections 
can be evaluated with accuracies of a few percent, provided that the number 
of dipoles N satisfies the relation 

N > ^ \mf {kaef ff (18) 

Oe/ / being the effective radius of the target of volume V : 

«.// = (^r'= ( 19 ) 

The DDA method also has been employed to analyze the scattering by inter- 
stellar graphite grains [59], by two spheres in contact [60], rectangular solids 
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[61], porous dust particles [62], bianisotropic scatterers [63], by human white 
blood cells [64], etc. 

3.4 Finite-Difference Time-Domain (FDTD) Method 

This method is broadly used in computational electromagnetism [65]. It is 
similar to the DDA in that the scattering system is descritized, but in other 
respects it is unique. The FDTD technique is a direct implementation of 
the Maxwell time-dependent curl equations to solve the temporal variations 
of electromagnetic waves within a finite space that contains an object of 
arbitrary shape and properties. In practice, the space including the scatter 
object is descritized by a grid. The basic element of this descritization is the 
Yee cell [66]. In general, any function of space and time u can be evaluated 
as 



u{iAx, jAy, kAz, nAt) = ulj f, (20) 

Here Ax, Ay, Az are the lattice space increments in the coordinate directions, 
and i, j, k are integers. At is the time increment, assumed uniform over 
the observation interval, and n is an integer. The fields for the case of an 
absorptive scatterer are computed by a time-marching scheme: 

E”+^ {x, y, z) = exp {-rAt) E” {x, y, z) (21) 

-l-exp (— rZ\f/2) —V x H"'*"® (x,y,z) 

£r 

H"+5 (x, y, z) = H"-3 {x, y, z) - —V x E" (a:, y, z) (22) 

where e = Sr + isi is the complex dielectric constant, r = coSijer, and y 
the permeability of the medium. The unknown value of each component of 
the electric field E"+^ depends on the previous value of the electric field E" 
and on the previous values of the other components of the magnetic field 
H”+5 at the sides of the point (z, j, k). By alternating these computations 
and stepping through time, the fields can be propagated through the entire 
volume, and the near or far-held scattering solutions can be obtained. 

In the FDTD method the grid used must be smaller than the wavelength 
and the time steps must be chosen such that some stability criteria is ensured 
[65]. Traditionally, FDTD results have suffered due to spurious, nonphysical 
rehections occuring at the boundary of the matrix. These problems have lar- 
gely been solved with the introduction of perfectly matched boundary condi- 
tions [67-69] . The method has been used extensively to study the scattering, 
absorption and polarization features of large hexagonal ice crystals [70,71]. 
Recently, Videen et al. [72] compared the results with those given by multi- 
pole techniques when examining the scattering from aggregated spheres and 
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used the method to study the changes that occur when the internal structure 
of the system is modified. 

There are still other techniques developed to solve the problem of scatte- 
ring from microstructures, for instance, the method of moments (MOM) [73] , 
the multiple multipole (MMP) method [74], the finite element (FE) method 

[75] , etc. The method of choice depends on the desired accuracy, the com- 
puting resources, and on the intrinsic size and complexity of the scattering 
system. With the development of so many techniques, the method of choice 
is often the one that is most readily available. Recent studies have been un- 
dertaken to compare different methods. Recent work by Wriedt and Comberg 

[76] compares the DDA, FDTD and EBCM methods for the same scattering 
system, a cube. Cooper et al. [77] compared the MOM, MMP and the Finite 
Integration Technique (FIT) for the sphere canonical problem. 

4 Microstructures on Surfaces 

4.1 Introduction 

The scattered electromagnetic fields change significantly when an object is 
brought in close proximity to a planar or near-planar surface or substrate. 
The combination of a microstructure and a smooth surface constitute by itself 
a rough surface, and its study is of practical interest in many areas: surface 
defect detection in the semiconductor industry [78], surface degradation by 
particle contamination [79], biosensors [80], optical particle sizing [81], etc. 
Many theoretical models have been developed and many experimental mea- 
surements have been carried out in recent years. In this section a short review 
of recent works in the field is presented. 



4.2 Dipole Methods 

The simplest system occurs when the microstructure size is much smaller 
than the wavelength, so the microstructure can be modelled by a dipole. This 
approximation is valid when the field within the particle is nearly constant. 
This approximation holds when the scatterer is not too close to the interface. 
On decreasing the particle-substrate separation, the assumption begins to fail 
because higher order multipoles become significant due to particle-substrate 
interaction. 

Lindell et al. [82,83] used exact image theory (EIT) to derive the scattering 
of electromagnetic waves from a small object above an interface separating 
two isotropic and homogeneous media. EIT is used for deriving a generalized 
Green function to take into account the presence of a nearby interface. Other 
simpler models have been shown to provide quite accurate results [84]. 

Most simple models assume the dipole is illuminated by the a superposi- 
tion of the direct and reflected plane waves. The total scattered held from the 
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dipole is the superposition of the direct and image scattered fields. No multi- 
ple interactions occur between the dipole and the substrate. Jakeman showed 
that analytical expressions for the field scattered from small particles (sphe- 
res, discs, and needles) distributed on a tilted interface agreed to first order 
with those of Rayleigh- Rice theory [85] . Videen et al. used similar expressions 
to study the polarization state of small spheres near substrates [86] . Moreno 
et al. [87] studied multiple scattering effects between two small spherical me- 
tallic particles on a flat conducting surface. Germer [88] used a similar theory 
to study the angular dependent polarization of out-of-plane optical scattering 
from dipole- like particles above and below a substrate. An exact calculation 
of the substrate effect in the scattering is considered by Ortiz et al. [89,90] 
and Valle et al. [91]. These works use the exact solution for the emission of 
a dipole close to a plane interface and include the surface waves components 
of the scattered field that are identified as the surface plasmon polaritons in 
the case of metallic substrates. 

4.3 Mie Methods 

The double-interaction model of Nahm and Wolfe [92] has been used to cal- 
culate the scattering by a sphere over a perfectly conducting mirror. In this 
model, the sphere is illuminated by the beam both directly and after a spe- 
cular reflection from the surface. This secondary reflected beam is partially 
obscured by the particle. The interaction between the sphere and mirror is 
assumed to be zero, so the sphere scatters light as if it were isolated. Each 
of these two beams generates two contributions to the total scattered field: 
one directly and another after being reflected by the mirror surface. Four 
components make up the total scattered field. Similar models include Fresnel 
reflection coefficients to handle nonperfectly conducting mirrors [93,94]. 

In a series of papers, Spyak and Wolfe [95-97] compare calculated light 
scattered by spheres on a mirror with experimental data. They assume that 
the particles scatter radiation according to Mie theory, and do so indepen- 
dently. The lack of agreement between theory and experiment for the backs- 
catter may be due to the radiation reflected off the mirror before illuminating 
the sphere not considered in the model. 

Bobbert and Vlieger [98,99] reduce the problem of light scattering by a 
sphere on a substrate to the problem of scattering by a sphere in a homoge- 
neous medium and that of the reflection of spherical waves by the substrate. 
The first problem is solved using the Mie theory, for the second one they use 
an extension of Weyl’s method for the calculation of the reflection of dipole 
radiation by a flat surface. They introduce a matrix characterizing the reflec- 
tion of spherical waves by the substrate. The matrix elements are integrals 
over a complex angle that can only be analytically evaluated in the case of a 
perfectly conducting substrate and in the case of very small particles. 

Videen expanded the interaction field about the image location to solve 
the scatter from a sphere in front of [100] and behind [101] a smooth arbitrary 
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substrate and later provided an exact theory for an arbitrary particle system 
in front of [102] and behind [103] a perfectly conducting substrate. Similar 
derivations for spheres resting on substrates have been provided by Johnson 
[104-106] and Fucile et al. [107]. In this methodology, the boundary conditions 
at the particle and at the surface are satisfied simultaneously by projecting 
the fields in the half-space region not including the particle onto the half- 
space region including the particle. For nonperfectly conducting substrates 
some numerical method or simplifying assumption must be used. Theoretical 
results for a cylinder above a substrate [108] calculated using this method 
were compared with experimental results [109]. Videen was able to show that 
the polarization state of highly irregular dust particles on a substrate was 
primarily dependent on the size of the particle and could be approximated 
by a single dipole placed at the center of the particle above the substrate 
[ 110 ]. 

Moreno et al. [Ill] and Pena et al. [81] developed a modified double inter- 
action model for the light scattered from spheres and cylinders on a substrate. 
The Fresnel coefficients for the reflection at the surface are considered as well 
as a geometrical shadowing factor to take into account the shadowing effect 
in the incoming and outgoing beams. 

Borghi et al. [112] presented a method for treating the two-dimensional 
scattering of a plane wave by an arbitrary configuration of perfectly con- 
ducting circular cylinders in front of a plane surface with general reflection 
properties. This is based on the plane-wave spectrum of cylindrical func- 
tions involved in the decomposition of the field scattered by a cylinder. The 
substrate can be dielectric, metallic, anisotropic or lossy media, as well as 
multilayered. 

4.4 Integral Methods 

A number of theories have been formulated in terms of exact integral equa- 
tions for the electromagnetic fields that are solved by standard numerical 
methods. It may be stressed that the formulation is exact in the sense that 
no additional physical assumptions or approximations are needed; i.e., all 
limitations come from the numerical procedure. Integral equations are deri- 
ved from integral theorems that combine differential Maxwell equations and 
appropriate boundary conditions. One of the methods most widely used is 
the Extinction Theorem (ET) of Physical Optics [9] which produces a sur- 
face integral equation that relates the incident field with the sources on the 
surface , and these with the scattered fields. Though the ET method was 
initially used to calculate the scattering by random rough surfaces, Valle et 
al. extended the method to calculate the far-held [113] and near-held [114] 
scattering by small metallic particles on hat conducting substrates. Due to 
the exact character of the formulation, multiple interaction between particle 
and substrate is taken into account. Saiz et al. [115] and Valle et al. [116] 
have studied the ehect of particle size, separation and the ehect of the optical 
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constants on the light scattering. The method has also been used to study 
surface plasmon polariton generation with small particles on real metallic 
substrates [117]. Madrazo and Nieto- Vesperinas have established the ET me- 
thod for multiply connected domains. This improvement allows the numerical 
simulation of the scattering from systems composed of surfaces belonging to 
separated bodies of arbitrary shape [118-120]. A different integral method 
is that proposed by Greffet et al. [121] In this case they obtain a volume 
integral extended to the particle volume, including the substrate, by means 
of adequate Green functions. They have calculated the far-held and near- 
held scattering produced by 2-D particles deposited on a dielectric planar 
waveguide, with special attention to particle interaction [122], and the near 
held corresponding to a dielectric rod below a metallic surface under surface 
plasmon generation conditions. 

4.5 Other Methods 

Many other approaches to the problem of scattering from microstructures on 
surfaces may be found in the literature. Wriedt and Doicu [123] solved the 
scattering from an axisymmetric particle on or near a surface with a forma- 
lism based on the EBGM and the integral representation of spherical vector 
wave function over plane waves. Special attention was put in the interaction 
between a particle and its image. Taubenblatt [124] used a modihed version of 
the coupled dipole method to calculate the far held scattering intensity from 
a dielectric cylinder on a surface, when illuminated with a plane wave with 
held vector along the cylinder axis. Schmehl et al. [125] used the same me- 
thod to analyze the scattering by features as contaminants on surfaces. These 
works show important ehorts in the improvement of numerical techniques in 
order to accelerate the computation or obtain good convergence rates. Wojcik 
et al. [126] showed numerical solutions of Maxwell’s equations for problems 
involving scattering from submicron objects on silicon wafers by considering 
time-domain hnite elements. Kolbehdary et al. [127] analyzes the problem of 
scattering of electromagnetic waves from a dielectric cylinder partially em- 
bedded in a perfectly conducting ground plane. A held equivalence theorem 
is employed to derive an integral equation for the equivalent magnetic current 
on the dielectric interface, which is later solved using the Galerkin’s method. 
The equivalent magnetic currents are then used to determine the scattered 
held everywhere outside and inside the dielectric cylinder. 

Finally, a very simple approach is the ray-tracing solution or, in other 
words, the application of the geometrical optics approximation. A plane wave 
incident on a metallic object is, from a geometrical point of view, a beam 
of parallel rays of uniform density that is rehected by the sphere-substrate 
system. The scattered held is obtained as the coherent sum of the group 
of rays emerging from the surface with a common angle. This method has 
been showed to produce surprisingly good hts to the experiment when the 
observation is far from the specular direction (where the dihraction ehects 
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are too strong), even for particles with size of the order of the wavelength. 
For instance Saiz et al. [128] analyzed by this means the backscattering from 
particles on a substrate or the effect of the different particle density on the 
scattering patterns [129]. 
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Abstract. The image principle in electromagnetics replaces a physical structnre 
by an eqnivalent sonrce, the image of the primary sonrce. The number of structu- 
res with analytic image expressions is limited but increasing. In finding new image 
expressions, the Heaviside operational calculus can be applied by first transforming 
the three-dimensional problem to one concerning transmission lines. The image pro- 
blem can be reduced to the compact form in terms of a pseudo-differential operator 
applied to the original source function. To interprete the result in terms of com- 
putable functions, approximations can be done at this stage. As a simple example 
of the theory the interface of two isotropic media with smooth or slightly rough 
interface is considered. A table of operational formulas is given as an appendix. 



1 The Image Principle 

The image principle gives a simple method to treat electromagnetic problems 
involving interfaces of material media [1,2] by replacing the structure behind 
the interface by an equivalent image source in a homogeneous medium. Most 
probably the first definition of the image concept in electricity or magnetism 
was given by William Thomson (later Lord Kelvin) in 1849 [3]: 

The term Electrical Images, which will be applied to the imaginary 
electrical points or groups of electrical points, is suggested by the received 
language of Optics; and the close analogy of optical images will, it is 
hoped, be considered as a sufficient justification for the introduction of 
a new and extremely convenient mode of expression into the Theory of 
Electricity. 

Formulating an electromagnetic problem involving a physical structure 
in terms of an image source can be compared to working with the Green 
function associated with the boundary-value problem. The image principle 
may well be the simpler choice because in many structures it can be expressed 
in closed form while the Green function cannot. Actually, the Green function 
can always be written in the form of an integral of the image source. Because 
the image source is a quasi-physical concept, physical intuition can help in the 
computation of fields. In fact, fields arise from image sources in the same way 
as they do from physical sources. Also, image sources can be approximated 
like the physical sources to simplify the computation, even if the image source 
might exist in complex space. Further, image sources can take the role of 
unknown in integral equations. 

F. Moreno and F. Gonzalez (Eds.): Lectures 1998, LNP 534, pp. 23-39, 2000. 
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Simple Example 

To keep equations simple, let us consider a scattering example from elec- 
trostatics. Assume a static charge distribution Po(r) and a dielectric scat- 
terer of permittivity e(r) = £oer(r) in air above a layered planar structure 
z < 0. The potential scattered by the obstacle can be computed after sol- 
ving the dielectric polarization distribution within the obstacle. This can be 
done numerically through an integral equation for the equivalent polarization 
charge density Pp(r), which can be defined by writing the Gaussian law in 
inhomogeneous dielectric medium as 

V-D(r) = V-[e(r)E(r)] = 0 „(r). (1) 

If the obstacle is replaced by the equivalent polarization charge in air, we can 
write 



V • [eoE(r)] = £»o(r) -h £ip(r). (2) 

From comparison, the equivalent charge becomes 

Pp(r) = -£oV • [(e^(r) - l)E(r)], (3) 

which is nonzero only inside the scatterer where er(r) yf 1. 

Now let us consider two possible ways of forming the integral equation. 
First, assuming that the Green function G(r|r'), i.e., the potential at r due 
to a normalized point source at r' in the presence of the planar structure is 
known, the potential from any charge distribution g{r') can be expressed as 
the integral 

0(r) = /G(r|r')— dE'. (4) 

J 

When computing values for the potential inside the source, the singularity 
of the Green function in the integral must be taken into account in a proper 
way. 

The potential inside the obstacle is due to the original source (incident 
potential 4>i) and the equivalent polarization source (scattered potential /pg), 
both in the presence of the structure: 

<(’(r) = </'*(!') +<('s(r), (5) 

(pi{v)= [ , (ps{r)= [ G{r\r')^^^dV. (6) 

J io J 

The incident potential can be considered as a known function. The scattered 
potential involves an unknown source Qp, which depends on the unknown total 
potential 4>{v). This gives rise to an integral equation for the total potential: 

<t>{v) = + J G(r|r')V' • [(e.(r') - l)V'P{r')]dV' , 



(7) 
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which can be solved numerically if only the Green function G(r|r') happens to 
be known. Finding the Green function G(r|r') may well be the most difficult 
task of the problem and normally leads to a numerical analysis of another 
integral equation. 



€-o€r 





Fig. 1. Original charge Qo and a scatterer above a planar structure. The integral 
equation for the potential in the equivalent polarization charge Qp is obtained with 
the aid of images of the original charge and the unknown polarization charge. The 
figure is drawn in simplified form, because, in general, the image of a point source 
is not a simple point source. 



In contrast, in the image formulation, only the free-space Green function 
Go(r — r') = l/47r|r — r'| is needed. However, now the image for the general 
point charge above the planar structure must be known. The integral equation 
for the total potential inside the scatterer can be formed by replacing the 
planar structure through the images of the original charge distribution and 
the unknown polarization charge distribution. Fig. 1. Similar arguments apply 
to problems involving time-harmonic sources with vector sources and dyadic 
Green functions. It is thus of basic interest to find image expressions for the 
point source with respect to different structures. 



2 Heaviside Calculus 

Oliver Heaviside (1850-1925) constructed the modern electromagnetic theory 
in compact form [4] by introducing the vector notation simultaneously with 
J.W. Gibbs. In his analysis of electromagnetic equations he also used a form 
of operational calculus, which eventually led to the Laplace transform tech- 
nique and to other operator methods [5,6]. In his calculus, Heaviside treated 
differential operators very much as ordinary numbers, which allowed him to 
apply simple rules of algebra to solving differential equation problems. In the 
following it will be demonstrated that the Heaviside operational calculus can 
be effectively applied to finding image solutions for electromagnetic problems 
by starting from transmission-line concepts. 
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The relation between the Laplace transform technique and operational 
calculus can be applied to define some operational formulas readily applicable 
to the image analysis. The definition of the Laplace transform f{p) — >■ F{q) 

OO 

F{q) = j f{p) exp{-pq)dp (8) 

0 

leads through substitution q = dz = djdz to a, (pseudo-differential) operator 
F{dz), which operating on the delta function 6{z — Zo) gives us the rule 

OO 

F{dz)S{z- Zo) = j f{p)exp{-pdz)6{z - Zo)dp = f{z - Zo)0{z - Zo)- (9) 
0 

Here we have applied the Taylor expansion of an exponential function and 
the shifting rule, 



oo / \ 

exp(-pq) = V — exp{-pdz)f{z) = f{z - p). (10) 

n\ 

n—O 

9{z) denotes the Heaviside unit step function 

6»(z) = 0, z<0, 0{z) = l, z>0. (11) 

Equation (9) forms the basis of the Heaviside operator method applied 
here. Typically, the image of a point source is obtained in the form of an ope- 
rator F{dz) operating on the delta function. The task is to find the function 
f{z), which defines the image in terms of a computable function. Apply- 
ing existing tables of Laplace transforms, we can write a table of pseudo- 
differential operators operating on the delta function. A short one is given as 
an Appendix. 

3 Transmission-Line Theory 

Transmission-Line Equations 

To solve the original three-dimensional problem, it is often necessary to re- 
duce the dimension from three to one, for example, by making a Fourier 
transformation in two dimensions. After solving the problem in one dimen- 
sion, the solution can be transformed back to three dimensions. The Maxwell 
equations then become transmission-line equations and the electromagnetic 
field vectors can be replaced by voltage U{z) and current I{z) functions: 



dzU{z) = -jpZI{z) + u{z), 



( 12 ) 
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d:,I{z) = -j(3YU{z) + i{z). (13) 

Here, u(z) and i(z) denote the distributed voltage and current source 
functions. /3 is the (complex) propagation constant and Z = \jY the line 
impedance and they are assumed constant, corresponding to a homogeneous 
3D space. The second-order transmission-line equations are obtained after 
elimination as 



{d\ Y (5'^)U {z) = d^u{z) -j(3Zi{z), (14) 

(af + f3^)I{z) = dj{z) - jf3Yi(z). (15) 

In some problems transmission-line equations more general than (12), (13) 
are needed with different line parameters /3, Z for waves traveling to opposite 
directions along the line [8]. 

A combined voltage-current point source at z = Zo on the transmission 

line 



u\z) = Uo5{z - Zo), i\z) = Io5{z - Zo), (16) 

produces incident voltage and current functions 

U\z) = ^[sgn( 2 ; - Zo)Uo + ZIo] exp{-j(3\z - Zo\), (17) 

I\z) = ^[sgn(z - Zo)Io + YUo] exp{-jP\z - Zo\). (18) 

Inserting these expressions in (14) and (15), their validity can be checked. 



Reflection from Load Impedance 

Let us consider a half-infinite homogeneous transmission line z < 0 termina- 
ted by a load impedance Zl at z = 0, Fig. 2. Assuming a point source at 
z = Zo < 0, which creates the incident wave (17), (18), the reflected voltage 
and current waves can be expressed as 

U''{z) = RU\tS)exp{j(iz) = ^{Uo + ZIo) exp(j/3zo) exp(j/3z), (19) 

r{z) = -YU'"{z) = ~^{Io + YUo) exp(j/3zo) exp(j/3z). (20) 

The load impedance Z^ may represent a physical structure (an inhomo- 
geneous transmission line, for example) existing in the region z > 0. In the 
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general case, the reflection coefficient is a function of the propagation con- 
stant, R = i?(/3). In the expression of the reflected voltage it can be replaced 
by an operator R{—jdz) operating on an exponential function 

U^{z) = i?(/3)C/*(0)exp(j/3z) = C/*(0)i?(-j9,) exp(j/3z) 

= ^{Uo + ZIo)e^];>{jl3{z + Zo)) = R{-jdz)U\-z). (21) 

If R{(3) is a polynomial function, R{—jdz) is a differential operator, other- 
wise it is a pseudo-differential operator. The form of the reflection-coefficient 
function is obtained from the boundary condition as 

w{o) + u^{o) = ZL[r{o) + r{o)] ^ ( 22 ) 



W{z) — - U^z) 




Fig. 2. Reflection image f of a current source i at < 0 lies on the extension 
of the transmission line at —Zo- It gives the voltage U'^(z) reflected from the load 
impedance Zl- 



Image Sources 

Let us now replace the load impedance by finding the image sources in the 
extension 2 ; > 0 of the homogeneous transmission line which give the same 
reflected wave (21) in the region 2 ; < 0, Fig. 2. The source of the reflected 
voltage C/'’(z) can be obtained in the same way as the source of the incident 
wave in (14), by applying the operator + /3^ on (21) and identifying the 
right-hand source terms: 

(af + f3^)U^{z) = R{-jdz){dl + p^)u\-z) 

= R{-jdz)[-dzu\-z) - jl3Zi\-z)]. 



(23) 
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Denoting the reflection images by u^{z) and i’’(z) and substituting the 
original sources, we have 



- jj3Ze{z) = R{-jd:,)[-d^u\-z) - jj3Zi\-z)], (24) 

from which we can identify the image sources as 

vJ'iz) = -R{-jd^)u\-z), i^{z) = R{-jd^y{-z). (25) 

For the point sources (16) the images are 

vJ'{z) = -R{-jdz)UoS{z + Zo), f{z) = R{-jd^)IoS{z + Zo). (26) 

The expressions (26) form the basis of the image method discussed here. 
In the simplest case when the reflection coefficient R does not depend on /3, 
the reflection image consists of voltage and current generators at the image 
point z = —Zo with the respective amplitudes —RUo, Rio- 

4 Time-Harmonic Planar Problems 

Let us consider problems involving a planar stratified structure, with mate- 
rial parameters depending on one coordinate z only. In this case, the one- 
dimensional image theory can be applied by making the Fourier transform in 
the xy plane, whence the problem becomes analogous to that of a transmis- 
sion line [2]. 



Transmission-Line Analogy 

The Fourier-transformed Maxwell equations are 

Uz X 9zE — jK X E = — jw/iH — M, (27) 

Uz X 9zH — jK X H = jweE -I- J, (28) 

where the electric and magnetic fields E, H and the electric and magnetic 
sources J, M are functions of z and the two-dimensional Fourier variable K. 
Equations (27), (28) can be represented as a coupled pair of transmission 
lines. The held components transverse to z give rise to two sets of voltage 
and current quantities 

Uo = • E, h = y^Uz X K • H, (29) 



Ufji — ^2 ^ ^ 



y = -—K.n, 



(30) 
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and the transverse fields can be expressed as 

E* = KUe + u, X KU^, (31) 

-U^ X H = Kle + X Kim- (32) 

The two transmission lines are uncoupled in homogeneous space and obey 
the equations 

{dl + P‘^)Ue,m{z) = d^Ue,m{z) - j (3Ze^rnie,ni{z) , (33) 

{dl + P'^)Ie,m{z) = dJe,m{z) - j f3Ye^mUe,7n{z) , (34) 

where the source functions are 

■Ue(z) = X K • M(z) + — • J(z), ie{z) = --^K- 3{z), (35) 

(jje 

imiz) = X K • J(z) - — • M(2), Ujn{z) = • M(z). 

ujfx 

(36) 

The propagation factor is the same for both lines but the impedance is 
different: 

l3 = a/Zc^ — K“^, k = Lo^/fle, (37) 



^e = 



toe 



P 



LOfJ, k 




(38) 



When there is an interface of two media at z = 0, coupling may arise bet- 
ween the two lines, which can be represented by a coupling two-port network. 
This means for example that, a source in the e line may create image sources 
in both e and m lines. However, for an interface between two isotropic media, 
there is no such coupling and the lines can be understood as terminated by 
individual impedances Zj^e and Z^m- 



Vertical Magnetic Dipole 

As a simple example, let us consider a vertical magnetic current source (di- 
pole) in the air (eo,/To) region z < 0 , above a dielectric half space (£^£0,^0) 
in z > 0. Since im is the only source, only the m line exists and the subscript 
m will be omitted. The correponding reflection coefficient at the interface is 



R{Po) 



Zl-Z 
Zl + Z 



P- Po 

P + Po 



(39) 
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The relation between the two propagation factors 13, (3 o is 

(3 = = \/fc2 - + /c2 _ ^2 = y ^/32 _ 52 ^ (40) 

with 

B = - fc2 = koVT^r- (41) 

For perfectly electrically conducting (PEC) or perfectly magnetically con- 
ducting (PMC) surfaces corresponding respectively to = 00 and = 0, we 
have R = —1 and i? = -1-1 corresponding to short and open circuit at z = 0. 




Fig. 3. The dielectric half space z < 0 can be replaced by the reflection image 
of the original magnetic dipole M. The image consists of a magnetic dipole and a 
magnetic line current obeying Bessel function law. 

The vertical magnetic dipole of moment MgL corresponds to a current 
source in the transmission line: 

M(r) = u^MoW(r - u^Zo) ^ i{z) = IoS{z - Zo), L = ^—MoL. 

(42) 

For the reflection image we now have from (26) the simple formal solution 
i^{z) = R{-jd,)IoS{z + Zo), (43) 



or, more explicitly, 



1^{Z) = -lo 



^di + B^ + d, 



S{z + Zo). 



(44) 
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Invoking (73) for n = 2 from the Appendix, the solution can be expressed 
as 

t^iz) = -Io-J2{Bz)0{z). (45) 

z 

Because the image does not depend on the Fourier variable K, the corre- 
sponding vertical magnetic current source has S{p) = S(x)S(y) dependence, 

M'’(r) = -u^cjpoi'’(z)S(p) = -u^MoL-J2(Bz)0(z + Zo)S(p). (46) 

z 

Thus, the image is a magnetic line source extending from z = —Zo > 0 to 
oo obeying the Bessel function J 2 {Bz), Fig. 3. The result (46) was previously 
obtained in [9,2] through a much more complicated analysis. 

It must be noted that the image expression works well for real €r < t, 
6r being the ratio of the two permittivities. If > 1, the coefficient B 
is imaginary and the Bessel function J2{Bz) diverges for real z. The same 
happens for complex e^.. By assuming the image along a line in complex space 
as explained in [9,2], a useful converging image source is, however, obtained. 
A source in complex space can be applied numerically to compute fields in 
the physical space with the same ease as a source in real space. Because the 
Green function decays exponentially when the source lies in complex space, 
some computation time can be saved. 

5 Slightly Rough Interface 

As a second example, an extension of the previous one, let us consider the 
problem of two dielectric half spaces with a slightly rough planar interface, 
whose average coincides with the plane z = 0. The interface is defined by the 
height function h{p), which is assumed to be a random variable obeying the 
Gaussian probability law, 

P{h) = exp(-/iV2CT^). (47) 

(TVzTT 

The parameter cr is a measure of the rms surface height variation. 

Reflection-CoefRcient Method 

Applying the well-known expression for the reflection coefficient correspon- 
ding to the coherent part of the reflected plane wave from a slightly rough 
surface, we can write [10,11] 

R = Rs exp{—2k^(j^ cos^ Oi), (48) 

where 9i is the angle of incidence. Rg denotes the reflection coefficient cor- 
responding to a smooth surface (cr = 0), also a function of 9i. For grazing 
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incidence 9i — >■ tt/ 2, the reflection coefficient approaches that of the smooth 
surface. 

Considering fields arising from a source above the interface, Fourier trans- 
formation again reduces the problem to one expanded in terms of plane waves. 
For each plane wave the reflection coefficient (48) can be applied in the form 

R{Po) = Rs(Po) exp{-2{kl - iC^)cr^) = Rs{f3o) exp{-2f3‘^a‘^). (49) 

The relation between the two-dimensional vector Fourier variable K and 
the angle of incidence defined by the unit vector u is 

K = ufcsin6 »„ /3„ = a/p - K • K. (50) 

After the Fourier transformation, the fields depend on one coordinate z only 
and the previous transmission-line analogy can be applied with [3o as the 
propagation constant. 



Vertical Magnetic Dipole 

Considering the vertical magnetic dipole source, the previous theory can now 
be applied with the reflection coefficient (48) replacing (39): 

i?(/3/ = eM-WW) = exp(-2/3>2). (51) 

Thus, the only difference in the resulting image is the extended reflection 
operator 



M/r) = n,MoLR{-jd,)5{z + Zo)5{p) 

= VL^MoLRs{-jdz) exp(2cr^a^)i5(z -f Zo)5{p). (52) 

and the problem is to interpret the expression involving a product of two 
operators 



R{-jdz)5{z + Zo) = exp{2a‘^d‘^)Rs{-jdz)S{z + Zo). (53) 

Extending the expression (9) to the product of two operators ^1(92), 
^2(^2) satisfying 



F,(d^)S(z) = fi(z)0(z), i = 1,2, 



leads to the following convolution rule: 



(54) 



00 

Fi(dz)F2(dz)S(z) = Fi(d,z)lf2(z)9(z)] = J fi{p) exp{-pd^)[f2{z)9{z)]dp 



0 
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= J h{p)[f 2 {z - p)9{z - h)]dp = [fi{z)6{z)] * [f2{z)9{z)]. (55) 

0 

Thus, the image of a rough dielectric interface can be expressed as the 
convolution of the images corresponding to the smooth dielectric interface 
and the rough PMC surface. The former was considered above. 

Let us consider the image due to a rough perfect magnetic conductor 
(PMC) surface which is obtained as the limit e^. — >■ 0 of the dielectric interface. 
Because we now have Rs = 1, (53) reduces to 

R{-jdz)5{z + Zo) = exp(2(T^9^)(5(2 + Zo)- (56) 

The corresponding magnetic current image can be written as 

M’’(r) = n^MoL ey:p>{2a‘^ dl)5{z + Zo)5{p). (57) 

To find the image function we invoke a result from [5] which cannot be 
found from Laplace transform tables, 



exp{ad^)S{z) = exp(— z^/4a). 

V27ra 



(58) 



This leads to 

M’'(r) = Uz exp{-{z + Zof lSu'^)5{p). (59) 

VSTTCr 

This represents a magnetic line current obeying the Gaussian distribution, 
with its maximum at the image point z = —Zo, Fig. 4. 




Fig. 4. Image of a vertical magnetic dipole above a slightly rough PMC plane is 
a magnetic line current broadened to a Gaussian pulse when the roughness of the 
surface obeys Gaussian statistics. 
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The image function corresponding to the rough dielectric interface can 
now be interpreted from (53) as the convolution 

0 

exp{2a'^dl)Rs{jd^)5{z) = - ^ [ exp(-(z' - zf / 8 a‘^)^J 2 {Bz')dz' , 

\ 2i'tt ( y J '2' 

— oo 

(60) 

which gives a blurred specular image [12]. It does not seem possible to find 
a closed-form expression for this image function. Numerical values can, ho- 
wever, be easily computed from the convolution integral and stored in the 
computer memory for quick reuse in an integral equation solution process, 
for example. The image function f(z) has properties similar to those of the 
specular image function. 

The change in the amplitude of the coherent reflected wave due to the 
roughness of the surface can now be interpreted in terms of the image theory. 
When a point image becomes a line image of Gaussian distribution, the en- 
suing phase difference makes partial cancelation in the radiated field. This 
effect is most notable in the normal direction and disappears in the direction 
tangential to the interface. 

The image principle described can be applied to a wide variety of pro- 
blems involving a rough surface. For example, scattering from an object above 
a rough surface can be handled by replacing the object through an equiva- 
lent polarization source. Because of the approximative nature of the starting 
reflection coefficient (48), the image method has some limitations. Theore- 
tically the image line extends from z = — ooto 2 = -|-oo and thus enters 
the wrong half space z < 0. However, because the roughness was assumed 
slight, the parameter cr is small, which means that the Gaussian distribution 
is very concentrated. Also, the plane-wave reflection approximation requires 
that the source height Zo be large enough so that surface waves are not ex- 
cited. Because of these limitations, the image has a negligible value in the 
region z < 0. 

6 Other Structures with Image Solutions 

A number of other structures have been studied in the past, through various 
methods, for finding analytic expressions of the image function corresponding 
to the structure in question. It may suffice to mention some solved cases 
and refer to the literature for details. For planar structures, the microstrip 
geometry (dielectric slab above PEG plane) [13], the dielectric slab in air 
[14] and the uniaxial anisotropic hald space [15] have been treated for time- 
harmonic sources while the general anisotropic half space has been solved 
for electrostatic sources [16]. A solution with more general time dependence 
involving a noncausal current was found for the impulse source in [17]. Also, 
a slightly chiral half space has found an analytic solution [18]. 
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The classical Kelvin’s electrostatic image theory for a PEC sphere has 
been extended to the dielectric sphere [19], also solvable through the genera- 
lized asymmetric transmission-line concept [8]. The theory was extended to 
the layered sphere [20] and low-frequency time-harmonic sources [21]. Other 
solved non-planar geometries include the conducting half plane [22] and the 
conducting wedge [23] for time-harmonic sources and the dielectric wedge 
[24] for electrostatic sources as well as the conducting spheroid [25]. 

As mentioned in the introductory part, image expressions like those above 
can be readily applied to scattering problems, some examples of which are 
given in references [26-28]. 

Appendix: Table of Operations 

Applying a table of Laplace transforms in, e.g., from [29], we can write a 
corresponding table for some pseudo-differential operators F(dz) operating 
on the delta function^: 



F{d,)S{z) = f{z)9{z). 




^n— 1 


(61) 




(62) 


d~^/^S{z) = 2^/^9{z), 


(63) 


9z"S{z)= j^^^^9{z), iy>0, 


(64) 


Q _^^'^W=exp( Bz)9{z), 


(65) 


a _L R S{z) = S{z) + {Bi S 2 )exp( B2z)9{z), 

Oz + ±>2 


(66) 


(a Sz)6»(z), 


(67) 



n = 1, 2, 3, • • • is an integer while i' may be any nnmber. 
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1 r. X exp(-Biz) - exp(-B 22 :) 






(a/^z + Si - + B 2 )( 5 (z) = ^=(exp(-Biz) - exp(-B 2 z)) 6 '( 2 ), 



/92 + 52 



(5(0) = Jq{Bz)9{z), 



{^dl + B^-d,y 

Jd 1 + 52 



5 ( 0 ) = B'' J^{Bz)9{z), 



/f )2 I D 2 <9 \ 1 77 

/y ^ ^ a ■'<-) = (x/aFTB^ - a.)“«(2) = -J„(B2)»(2), 



+ (74) 






a - 1 \ " J 2 n{z) 



— 1 \a+ly z 

n=l ^ 



1 f z — a 



exp ( — \J&1 + B'^a) 



, ^( 2 ) = ^ jyBVz^-a‘^)9{z-a), 

fdl + B-^{^dl + B^ + d,Y B''\z + a) 






32_b2 I 0^+ /02_S2 
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Abstract. The study of ill-posed linear inverse problems, characterised by Fred- 
holm equations of the first kind, has important applications in many areas of science 
and technology. Problems of this type introduce some loss of information between 
what we may call a generalised “object” and its “image” under the linear mapping. 
This loss of information often makes the attempted inversion of image to object 
very difficult. In practice the image is usually a discrete set of observations but the 
mapping may be retained essentially as a finite-rank integral operator using a fine 
discretisation throughout in the object space. This gives a very useful “automa- 
tic” interpolation of the recovered object at a sampling rate much higher than the 
conditioning of the problem would otherwise allow. Sub-micron particle sizing by 
Photon Correlation Spectroscopy is a problem where this type of inversion difficulty 
arises. Work is presented here which attempts to add in some of the lost information 
by making use of the a-priori constraints of positivity and known moments. This 
is achieved by the dual method of quadratic programming using some spectacular 
new mathematical results in this field. 



1 Overview 

The light-scattering method of photon correlation spectroscopy (PGS) [1] has 
now been used for over twenty-five years for determining the size distribu- 
tion of dispersed suspensions of sub-micron particles. These particles may be 
macromolecules of varying types including, for example, proteins, enzymes, 
viruses, polymers, liposomes and other laminar vesicles, micelles and droplets 
in microemulsions. 

We will first give a short discussion of the origins of the technique and 
the basic physics, consider briefly experimental methods and their evolution 
and pose the data inversion problem to be addressed. These introductory 
sections of the paper which introduce and set up the scene will lean heavily 
on a recent review article written by one of the authors [2] for specialists 
in pharmacology, who use the technique for many important applications 
including the design of new drug delivery systems. 

We also base the method to be described in this paper on an observation 
[3] that the data inversion can be improved by a different treatment of back- 
ground subtraction and describe this in a section taken substantially from an 
article in a NATO ASI proceedings [4]. Other parts of the paper also sum- 
marise previous work of the authors but the application to the PGS problem 
is published here for the first time. 
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© Springer-Verlag Berlin Fleidelberg 2000 
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From the point of view of instrumentation and technique, given careful 
sample preparation, in particular, scrupulous cleanliness to avoid the intro- 
duction of dust or other contaminant particles, sufficient dilution to avoid 
particle-particle interactions and complete dispersion to break up aggrega- 
tions, the PCS measurement of the mean value and variance (the polydi- 
spersity) of the particle-size distribution is easy, fast, accurate, absolute and 
non-perturbative. Pre-aligned or carefully aligned optics and standard detec- 
tion electronics, used within their known limits, rarely give trouble and, as 
provided by a number of commercial manufacturers, can be found in routine 
use in many laboratories world-wide. 

The main purpose of this paper, however, will be to deal with the difficult 
problem of obtaining further information about the particle-size distribution 
by inversion of the correlation function. The difficulty is that, in contrast to 
the extraction of the mean and variance, this problem is one that falls into 
the class of linear inverse problems involving compact operators that suffer 
from ill-posedness and is, in fact, one of the more extreme problems of this 
type. 

Such inverse problems have been the target of many investigations in 
recent years in many different areas, including other electromagnetic scat- 
tering problems, including imaging, and progress is still being made. For 
many years we have preferred to use techniques based on the singular value 
decomposition (SVD) of the kernel as they provide the most soundly based 
mathematical description of the effects of linear compact operators. The SVD 
method has been extended recently with some quite spectacular new results 
obtained using the theory of mathematical programming. These first prove 
the existence of a unique positive primal solution of minimum norm 
which preserves the measured singular function coefficients and then, under 
very general conditions, also prove the existence of a dual problem with a 
unique optimum. 

We have implemented these results in PCS and will show that they provide 
a powerful new approach to the inversion problem. 

2 The Basic Physics 



That the thermal diffusive Brownian movement of microscopic particles could 
be studied by coherent light scattering first seems to have been noted by Sir 
C V Raman. In 1919 [5] he had briefly reported the observation of the effect 
which we now call speckle, the random granular pattern seen in coherent 
light that has been scattered from a rough surface. In 1943, using a layer 
of lycopodium powder on a glass slide, in both plane and spherical wave- 
fronts Ramachandran [6] first correctly formulated and verified the basics of 
speckle-pattern formation. By visual classification of the scattered light pat- 
terns he demonstrated the exponential law of Rayleigh [7] for the distribution 
of intensity from a large number of interfering, randomly-phased oscillators. 
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In the conclusion to his paper he says that Raman had suggested that Brow- 
nian motion might be studied by observing the fluctuations of these speckle 
patterns and that he was proposing to study this possibility experimentally. 

Unfortunately, there seems to be no record of such further work from him. 
There is, however, a description of an experiment by Raman himself in his 
1959 Lectures in Physical Optics [8]. He viewed a thin film of milk illuminated 
by a mercury lamp behind a very small aperture and says “Bright points 
of illumination continually appear in the held and others disappear. These 
changes become less rapid and ultimately stop when the film is dry” . These 
fluctuations of a moving speckle pattern from sub-micron particles may be 
considered as a beating together of the components of the light held scattered 
by each individual particle. 

The basic theory of “light beating” was first given by Gorelik [9] and 
independently by Forrester et al. [10], both in 1947, and such experiments 
were performed by Forrester et al. [11] and by Hanbury Brown and Twiss 
[12] in the mid-fifties. 

Cummins et al. 1964 [13] first obtained quantitative information about 
the Brownian motion of suspended dispersed scatterers, and thus about their 
sizes. They scattered a red beam from a Helium-Neon laser off a suspen- 
sion of polystyrene spheres. They heterodyned the scattered signal with a 
“reference” signal split off from the same laser beam which was shifted in fre- 
quency by passing through an ultrasonic “Bragg cell” . This is the principle 
used to detect radio and television signals by beating them against a fre- 
quency shifted “local oscillator” in a non-linear detector. They measured the 
frequency spectrum of their photomultiplier detector output current (in the 
KHz range) with a wave analyser. As we shall see later, for a monodisperse 
suspension of spherical macromolecular particles the amplitude spectrum of 
the fluctuating optical speckle pattern is predicted to be of Lorentzian profile 
and in this experiment this spectrum is seen directly in the fluctuations of the 
photocurrent. The smaller the particles, the faster is the Brownian motion, 
the faster the intensity fluctuates and the broader the linewidth becomes. 
The spectral linewidth thus gives a measure of the particle size. 

At the time of the first experiments of Cummins et al. on particle scatte- 
ring, the coherence of the laser was being used for other light-beating studies 
in the laboratory of Benedek at MIT where, for example, very beautiful ex- 
periments were performed on the critical opalescent fluctuations of pure SFq 
[14], experiments being refined even to this day to study critical fluctuations 
in xenon in the microgravity environment of the US Space Shuttle [15]. As 
was done by Raman, in these experiments Ford and Benedek observed the 
fluctuations of the scattered light held without the addition of the reference 
beam. 

For a dilute particle suspension, where the particles are spaced at sepa- 
rations of many optical wavelengths, each particle can be considered to be 
one of Rayleigh’s randomly phased model oscillator sources; in the common 
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situation where there are a large number of particles in the observed scat- 
tering volume Rayleigh’s exponential distribution of detected intensities is 
accurately observed. 

Such a “self-beating” experiment on biologically interesting molecules was 
first published by Dubin et al. [16], who made measurements of bovine serum 
albumin, ovalbumin, lysozyme and also polystyrene latex spheres. They also 
studied tobacco mosaic virus and DNA and found that the non-sphericity of 
these molecules leads to deviations from the Lorentzian spectrum. A theo- 
retical analysis by Pecora [17], which predicts angular-dependent, multi- 
Lorentzian spectra, can be used to obtain shape information of such molecu- 
les. 

The field is thus sometimes called light-beating spectroscopy or intensity 
fluctuation spectroscopy. It also goes by the name of dynamic light scattering. 
In the absence of a reference beam it is called “self-beating” spectroscopy 
and also still quite commonly (although more problematical since it conflicts 
with heterodyning with a reference beam at baseband in radio and microwave 
terminology) “homodyne” spectroscopy. True homodyne spectroscopy in the 
radio-frequency sense, i.e. heterodyning with an unshifted reference beam, is 
also perfectly feasible in laser scattering (see Jakeman et al. [18]) and (as also 
does heterodyning) gives an improved signal-to-noise ratio of almost an order 
of magnitude over the self-beating method since only the signal beam out of 
the beating pair of signals fluctuates. However, the difficulties in the optical 
region of aligning wavefronts to the required wavelength-scale interferometric 
accuracy at varying scattering angles has precluded the widespread utilisation 
of this advantage and the self-beating method has become standard. 

3 Experimental Methods and the Photon Correlation 
Function 

The measurement of particle size from a few nanometres to a few microns 
in diameter by photon correlation spectroscopy is now a rather mature held 
with a number of commercial companies offering equipment and software 
packages. Nevertheless, experimental methods do continue to evolve. 

The first instruments were marketed from 1971 by the Malvern (UK) 
company Precision Devices and Systems (now Malvern Instruments). At the 
centre of the set up was a goniometer, which carried a photomultiplier tube 
assembly and collection optics on a rotating arm, which could collect light at 
angles between about 20 and 160 degrees. The photomultipliers were able to 
count individual photon detections. 

The sample was contained in a 1 cm-diameter cylindrical cell in an index- 
matching bath on the axis of rotation. A thermostatted housing kept the 
sample and bath at a constant temperature. The laser, mounted on a fixed 
second arm, was usually helium-neon of about 3 — 5 mW, but for very dilute 
samples of low molecular weight an argon-ion laser was used which could 
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supply typically a few hundred mW. The laser beam was focussed down into 
the cell to a few hundred microns diameter with a simple lens. 

In some PCS instruments a fixed scattering angle of 45° , 60° , 90° or 135° 
is used; this is sufficient for spherical particles in the Rayleigh scattering 
region (^r << 1), where A is the laser wavelength and r the particle radius, 
since the angular dependence of the scattered intensity is then negligible. 

For the type of (stationary) optical signals encountered in light scattering 
from macromolecular suspensions the spectrum of the optical-field fluctuati- 
ons is given by the Fourier cosine transformation of its autocorrelation fun- 
ction (the Wiener-Khinchine theorem). The “analogue” anode photocurrent 
is, in fact, the superposition of the amplified individual photoelectron emis- 
sions from the tube cathode plus lower-level noise from the tube base and 
its associated circuitry. By using a discrimination threshold the latter com- 
ponents can be rejected and, with sufficiently fast circuits so that the pulses 
do not overlap, a replica of the digital pulse train of photon detections at 
the cathode can be output in standardised pulses, each typically of the order 
of a few volts in amplitude and some tens of nanoseconds in width. In the 
first “Malvern Correlators” these pulses then went forward to a 50ns digital 
correlator. This is sufficiently fast for most particle sizing applications and 
most current systems, in fact, are actually slower. 

The autocorrelation function of the detected pulse train is called the di- 
gital photon correlation function and is defined by 



+ Ti) ( 1 ) 

3 

where n{tj) is the number of photon detections registered in the interval tj 
to tj -I- T, T is a sampling time chosen to be short compared to the time 
scale of the fluctuations and Ti is the delay time of the fth sample. The 
superscript (2) refers to the fact that the correlation function is describing 
the fluctuations of the intensity I(t) (photon arrival rate), which by quantum 
detection theory can be shown to be given by the modulus of the square 
of the electromagnetic field. This function is a good approximation to the 
continuous intensity correlation function f < I(0 )I(t) > dr. GG^(t) is more 
conveniently written in the normalised form 5 ^^^(r), where 

jra(r) = Co„st|Ff^_l| + l. (2) 

The constant, which will be greater than 1, depends upon optical aperture 
sizes, in particular the size of the detector pinhole, but these should be arran- 
ged so that the constant is as close to 1 as possible consistent with having, 
say 10 or more photon detections in the mean decay time of the correla- 
tion function. In practice, G^^^(oo) is estimated at a long delay time where 
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the fluctuations are independent; it measures the square of the mean inten- 
sity. Since the zero-delay channel is not normally measured, it is difficult to 
normalise accurately to its theoretical value of 2 at the origin and the 
normalisation constant is therefore carried along but does not affect the time 
dependence of the functions, which is used to And the particle sizes. 

Rayleigh’s exponential law for scattered intensities arises from a gaussian 
law for the scattered amplitudes and this, together with the stationarity 
property, leads to a relation between and the autocorrelation function, 
of the optical held, Siegert [19]: 

g^^Hr) = l + \[g^^\r)]f- (3) 

Miniaturisation of electronic components have made possible quite specta- 
cular changes in the size and performance of these instruments. Most photon 
correlators are now available in the form of plug-in boards for personal com- 
puters and one, designed by the first author and his group at King’s College, 
London, even runs very quickly as a software application on a general pur- 
pose CPU. This gives more flexibility than does dedicated hardware. More 
recently miniature photomultiplier tubes and solid-state avalanche photo- 
diodes (APDs) have been used [20]. The same is true on the optical side, 
following work by R G W Brown at RSRE Malvern in the late 80’s [21,22]. 
Modular systems using semiconductor lasers, solid-state avalanche photodi- 
odes and single-mode optical fibres are in use today in space-shuttle expe- 
riments [23] and are also available in some recent commercial instruments, 
where there are particular requirements for very high sensitivity. 

4 Mean Particle Diameter, Polydispersity, and 
Higher-Order Moments 

The data presented in the photon correlation function are a set of values 
of at delay times r^, which may be spaced linearly: n= T, 2T, 

3T, 4T, ... , or, more soundly, as we shall explain later, geometrically, 
Ti = T, (5r, i5^r, 6^T, . . . , where S is called the “dilation factor”. The sample 
time, T, is chosen so that, on a logarithmic time scale, (rd varies little at 
both low and high values of delay time so that the “action”, as it were, takes 
place well within the delay times covered. To determine the mean particle dia- 
meter and the polydispersity of a distribution of spherical or near-spherical 
particles we must first realise that the raw data give information about par- 
ticle sizes only by relating the first-order correlation function g^^\r) to their 
translational diffusion coefficients, Dt- We firstly utilise the Siegert relation 
of equations (2) and (3) to deduce g^^^ from the measured G^^^ (to within 
a constant factor) . The primary relations of the particle-sizing technique are 
then that for scattering from dilute monodisperse particle suspensions at an 
angle 0 from the forward direction we have the single-exponential relation 

g(i)(Ti) = exp{-K‘^DTTi) (4) 
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where K is the magnitude, ^sin^(|), of the scattering vector, and that 
for dilute polydisperse suspensions with distributions P(Dj’) of translational 
diffusion coefficient we have the linear superposition of exponentials, one for 
each Dt- 

9^^\Ti)= / P{DT)exp{-K'^DTTi)dDT, (5) 

J Dt {min) 

where we have cut off the distribution at suitable upper and lower limits 
according to a priori knowledge available to the experimenter. It is convenient 
to denote K^Dt by P so that this equation becomes 

pr{max) 

= / P{P)exp{-PTi)dP. (6) 

J r{min) 

To determine the mean value < P > (first moment) and the T-polydispersity 
(second central moment or variance), 

9^2 = J Pimr- < p >fdP, (7) 



it suffices to fit the normalised correlation function to an expansion in these 
first two moments of the form 




[G(2)(oo)] / 



= Constant— < T > r + — ^2 



(8) 



This is a simple and accurate (and widely used) method for recovering these 
parameters of the distribution. The scattering cross-section of a macromole- 
cule in the Rayleigh region varies as the square of its molecular weight. Mi, 
so that for a polydisperse suspension the mean value of < T >/K'^ gives a 
weighted average of diffusion coefficients, '^CiMfDi/'^aMf, where Ci is 
the concentration of molecules of molecular weight Mi. This is called the “z- 
average” molecular weight, Mz, and is, happily, just the average needed in 
the Svedberg equation (assuming constant partial specific volumes) to give 
the weight-average molecular weight, M^,. For narrow distributions the value 
of H 2 1 < P can be related to polydispersity indices such as M^/M^ by 
assuming model forms, such as random coils, for the scatterers [24] . 

The link with particle size is then made by using the Stokes-Einstein 
formula 



Df 



kT 
Girrir ’ 



(9) 



where k is Boltzmann’s constant, T is the absolute temperature, rj is the 
solvent viscosity and r is an “equivalent spherical radius”, that is to say it 
is the actual radius if the particle is a sphere but an estimated measure of 
equivalent size if not. 
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Equation (8) gives the first two terms of an expansion of the correlation 
function in a series of cumulants first derived by Koppel [25] 

°° t — T-i™ 

ln|5(i)(r)| = (10) 

ml 

m—l 

where is the cumulant (semi-invariant) of P{r). In principle, the- 
refore, the higher-order cumulants can be found by fitting ln|g^^^(T)| to a 
polynomial function. The first three cumulants of a distribution are identical 
to its first three central moments but the fourth and higher cumulants are 
certain combinations of higher-order moments. Since, in practice, these can- 
not be found accurately from experimental data, for reasons which we shall 
explain in the next section, we will not discuss this method further here. 

5 Singular Value Decomposition and Exponential 
Sampling 

The reason for the lack of success of the cumulant expansion to investigate 
properties of the distribution beyond its variance are very subtle and have 
only been understood more recently in terms of a theory of information [26] , 
based on the singular value decomposition of a compact operator, which 
generalises that of Shannon [27]. In simple terms, it turns out that there is a 
unique expansion for the distribution in which each successive-order term is 
protected as best possible against the inevitable noise present. This is called 
a “singular-function” expansion. (Mathematically speaking any expansion or 
inversion method would do if there were no noise). The cumulant expansion is 
thus not the best way to recover higher-order polydispersity information and 
one can say the same of any other ad hoc recipes for this inversion problem. 

The singular-function expansion is determined by analysing (6), which is 
a Fredholm equation of the first kind involving a compact integral operator, 
and which suffers, as do all such equations, from a problem known as ill- 
posedness, or in its discretised form ill-conditioning. This can be understood 
by considering the exponential function in the integrand, the kernel, as a 
smoothing function for the distribution P{r). If P{P) has any fine oscillatory 
structure it will not contribute strongly to the integral since multiplication 
by the smoother exponential function and integration will smooth it out. 
The mathematical expression of this phenomenon is the Riemann-Lebesgue 
theorem 

pOO 

lim / sm{nP)K{r,T) dP = 0, (11) 

Jo 

where K(P,t) is any integrable function of P. 

Each Fredholm equation of the same type as (6) has its own unique “best” 
expansion series, determined by its particular kernel function. These series 
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are analogous to Fourier series expansions in that successive terms have finer 
and finer structure and their coefficients are easily found by projection since 
each function is orthogonal to all the others, i.e. the functions in terms of 
which the solution is to be expanded form a complete orthonormal set. 

The other generalisation from Fourier analysis is that two such “basis” 
sets of functions are required, one, u„(r), in which to expand the data, 
and one, m„(F), in which to expand the the solution, P(F); n = 0, 1, 2, 3, . . . . 
The integral equation transforms each orthogonal component of P{r) into 
the corresponding component of 



a„u„(r) 



J Un{r) exp{~rT)dr. 



( 12 ) 



where is a real constant called the nth singular value and which determines 
the strength at which the nth component is “transmitted” by the experiment 
into the data values. Thus if the diffusion-coefficient distribution were to be 
given by the function u„(T) the correlation function would be a„n„(T). The 
values of decrease to zero as n — >■ oo. Of course, all the except uq 
have negative regions so a positive- valued combination of the Un is needed to 
correspond to a real physical distribution. 

The Fredholm equation of our problem (6), is a finite Laplace transform 
and with a suitable discretisation in T, which can be fine enough to avoid 
any need for interpolation, its “singular system” comprising its singular func- 
tions, Un{r) and Vnir), and singular values, a„, can be found using standard 
numerical routines to be found in any comprehensive mathematical software 
library. 

The full Laplace transform does not correspond to a compact operator 
and so does not possess a singular system. One can, however, find a non- 
countable set of eigenfunctions for the full Laplace transform and these were 
first found in a closed analytic form by McWhirter and Pike in 1978 [28]. The 
eigenfunctions have the form of cos(log) functions and the eigenvalues were 
found to be given by 



““ " \l coshVu;) ’ 

where the integer order index n in the singular system is replaced by the 
continuous index w (as happens in the transition from Fourier series to Fourier 
integrals). 

The eigenvalues show a rapid exponential decrease in the strength at 
which higher-order functions contribute to the correlation function and the 
same effect occurs, of course, in the finite-transform case. Given the singular 
system of the equation, the mathematical solution to the inverse problem 
posed by (6) is given in an analogous way to Fourier analysis by 



Pin /5^^H'r)n(r)dT 

P{r) = 2^ m„(T). 



n—0 






(14) 
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Division of the expansion coefficients by smaller and smaller values of 
a„ must be stopped and the series truncated or tapered off in order not to 
include amplified noise in the reconstruction. In practice, with the order of 
noise achievable in real experiments, it turns out that only a small number 
of “visible” independent components can be reliably recovered in the data- 
analysis process. The resulting solution we refer to as the truncated singular 
value decomposition solution, or TSVD solution for short. 

Using the analogy with the sampling theory of Fourier analysis and the 
geometrically spaced zeros of the eigenfunctions found by McWhirter and 
Pike, Ostrowsky et al. [29] proposed an “exponential-sampling” method of 
data reduction in which the solution is reconstructed at geometrically spaced 
r values within the range chosen. Due to the ill-conditioning, only a small 
number of sample values can be used and an interpolation, equivalent to the 
sin(a;)/a: of Fourier sampling theory, but now stretched out exponentially, is 
used between the points. Calculation of the first two moments, as described 
above, is first performed in order to choose a suitable range of F for sampling. 

The sampling and interpolation theory of the full Laplace transform inver- 
sion problem can, in fact, be formally mapped onto the Whittaker-Shannon 
theory of the Fourier transform inversion problem by geometric dilation of 
the variables and this is the reason for the use of geometrical spacings for the 
sampling points of both the data and the solution in this problem. Just as in 
the Fourier-transform case, there is no further information to be found within 
the selected bandwidth between such sample points. The use of such geome- 
tric sample spacings reduces both hardware requirements and data-reduction 
times and has been perhaps one of the most significant developments in PCS 
particle sizing since the method was first introduced. 

6 Removal of the Background 

A more recent development in the inversion of the correlation function has 
been the proposal for an improved method [4], initiated by Ross et al. [3], for 
handling the reduction of the measured data which avoids taking the 
explicit square root in the Siegert relation (3). 

The new method uses the fahltung theorem for the Laplace transform as 
follows. 

cOO 

\g(^){r)\^ = \ P(T)exp(-rr)drp 
^0 



cOO cOO 

/ P{x) exp(— xr) dx / P{y) exp(— yr) dy 
Jo Jo 



which, by the change of variable x y = z 

cOO pOO 



cOO cOO 

/ / P{y)P{z-y)exp{-ZT)dydz 

Jy Jo 
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= / P^‘^'>{z)exp{-zT)dz (15) 

Jo 

where 

/ oo 

P{y)P{z - y) dy 

-OO 

i.e. the inverse Laplace transform of the data minus the background is the 
convolution of the distribution with itself. To obtain this result we have ex- 
tended the limits of the integrals by using the facts that P(t/) = 0 for y < 0 
and P{z — y) = 0 for z < y. The authors of [3] now find the distribution fun- 
ction by deconvolving P^“^\ They accomplished this, in fact, by a non-linear 
regressive fitting of a six-point histogram for P{P). They are not explicit, 
however, about the treatment of the background. One presumes that some 
effort was made to guess it and subtract it from the data before the Laplace 
inversion was performed. Let us assume that the residual error in the back- 
ground estimate is e whichever method is used. Since the estimate of 
typically will fall monotonically to e while that of will fall monotonically 
to the relative residual error in the transformed data will be much less in 
the first case than in the second. For example a background in error by one 
per cent in the estimate of turns into a background error in the estimate 
of g^^'^ of ten percent. One also gains a bit more since the support of the 
transform in the first case is twice that of the second which aids accuracy. 

To investigate further we note that the deconvolution can be done by 
using the fahltung theorem for Laplace transforms, namely, 

POO 

C P{y)P{z-y)dy = CP{y)CP{y) (16) 



where C denotes the Laplace transform. Thus: 



P{P) = C-^{Jc{C-^g(^)-e]}. 



We see that the particle-size distribution is the inverse Laplace transform of 
the square root of the Laplace transform of the inverse Laplace transform of 
the data minus its estimated background! We seem to have gone around in 
a circle since, by cancelling out the two transforms under the square root, 
this is nothing but the original method! The fact that the square root of the 
background error comes back into the calculation again merely shows that the 
deconvolution of is also an ill-conditioned problem. Since, however, the 
method by which one performs the deconvolution should not obviously affect 
the result we have failed to find the advantage claimed for the new method. 
One can hardly dispute the experimental results which show a significant 
improvement in accuracy and we explain the improvement by the fact that. 
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in addition to the more advantageous support for the transform, the circle 
has been broken implicitly after the direct inversion of the data by ignoring 
the part due to the background, outside the estimated support of 

the convolution. 

Our first step in the reduction of this experimental data uses this new me- 
thod of direct inversion of using singular value decomposition, followed 
by the deconvolution of described above, implemented in our case by the 
fast Fourier transform. The SVD method is linear and uses a precomputed 
singular system of the kernel. 

We will call our version of the method of Ross et al. the TSVD(2) method. 

7 The Primal Method of Mathematical Programming 

The TSVD solutions, although fast and within the basic “information con- 
tent” of the data, do not make use of the fact that the solution is a distribution 
function and thus is known to be positive. 

We discuss two methods for incorporating the prior knowledge that the 
solution is positive into the TSVD methods. The methods are based on ma- 
thematical programming techniques. One method can be viewed as a primal 
method [30] and the other, which we will discuss in the next section, as its 
dual [31]. Provided the singular functions are analytic these methods both 
deliver the same solution, namely the positive solution of minimum 2-norm 
which agrees with the truncated singular function expansion in its known 
terms. That such a solution exists and is unique has been proved in [32]. As 
might be expected, due to the use of the extra information this solution also 
appears to possess higher resolution. In the presence of noise both methods 
can sometimes fail to converge and in these situations there are simple re- 
medies which yield approximate solutions. While there may be no reason to 
suppose the unknown object should be that of minimum 2-norm, our method 
has the advantage over other non-linear methods that for a noiseless band- 
limited object the exact solution is delivered and also, in a certain sense, it is 
that positive solution which adds the minimum to the known TSVD solution. 

The determination of this solution is a standard mathematical program- 
ming problem. Let us introduce the operator A : L‘^{r{min),r{max)) — >■ 

^JV+l 

given by 

pr{max) 

{Af)i= / f{r)u,{r)dr z = o,...,fv (is) 

J r{min) 

The equality constraints for the solution we seek then take the form 



{Af)i = d. 



i = 0,...,7V 



(19) 
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where di is determined from the data via 



/ g^^^T)Vn{T)dT 



(20) 



The primal problem, which we have described above, is then given by 



min ll/IU 

f^L'^{r{min),r{max)),f>0,Af—d. 



( 21 ) 



In order to solve the primal problem we note that, on discretisation, it 
takes the form of a quadratic programming problem given by: 



min -F(a) = ||Da||^ 



( 22 ) 



subject to the constraints 

la < a < Ua and If < Ca < Uf (23) 

where the free variables {a„} are the coefficients of the singular functions in 
the solution we seek. Matrix D generates the “cost” to be minimised, based 
on the current value of the variables, a. Matrix C is a constraint matrix, 
specifying constraints on linear combinations of the variables. In general C 
and D are different, but in our problem we shall see that they take the same 
form. 

When the solution is discretised to Nf points we have Nf singular fun- 
ctions and (22) and (23) describe a quadratic programming problem, where 
the objective function, F, depends on the Nf free variables {of}. 

The first of the linear constraints, la < a < Ua, concerns the values that 
the free variables are allowed to take. From our preliminary analysis the first 
-I- 1 of these values are already known, and so the first -|- 1 elements of 
la and Ua are both set to this calculated value to give an equality constraint. 
Many inverse problems exhibit a sharp fall off in their singular value spec- 
trum. For this reason we assume that nothing is known about the for i 
beyond the fall-off point and so the remaining elements of la and Ua are set 
to — oo and oo respectively. 

The second set of constraints, involving the constraint matrix C, are used 
to impose positivity on the solution. Let f denote the discretised positive 
solution we seek, f{i) represent the component of f and Uk{i) the 
component of u^. Then: 



Nf-l 

m = E ak-Uk{i) (1 < i < Nf) 



( 24 ) 
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Introducing the positivity constraint, 0 < f{i) < oo, at the ith point then 
leads to: 



Nf-l 

0 < E ak-Uk{i) < oo (25) 

fc— 0 

Hence in the second of the constraints in (23) we have If = 0 and Uf = oo 
where the constraint matrix, C, is given by Cij = Uj{i). The optimisation 
matrix D in (22) that ensures f is the minimum 2-norm solution can be 
derived in a similar manner to this, and also takes the form Dij = Uj{i). 

It may be observed that the dimensionality of the problems set out in 
equations (22) and (23) may be reduced to Nf — {N + 1) free variables. By 
recognising that the first -|- 1 variables must remain equal to the value 
specified by the TSVD expansion, and forming matrices D and C in the 
same way as before, the problem becomes 



min i^(a) = ||f + Da|p (26) 

subject to the constraints 

la < a < Ua and If < Ca < Uf (27) 

where now 1/ = — f. Now, instead of {ofc} generating the entire reconstruc- 
tion, it merely generates the extra portion that ensures positivity. This is then 
added to the TSVD expansion, f. Making this simplification clearly shows 
the aim of the method; there is an in-band component forming the TSVD 
expansion plus an out-of-band component that is added to ensure positivity. 
However, sometimes it can be beneficial to allow the first iV -|- 1 values of 
Ofc to become perturbed slightly from their calculated values. For this rea- 
son, the slightly more complex formulation in (22) and (23) has been used in 
subsequent calculations. 

The actual quadratic programming method used to solve this problem was 
E04NCF of version 15 of the Fortran NAG library. E04NCF is essentially the 
same as the better known routine, LSSOL, described by Gill et al. [33]. The 
method has two phases, the first of which is to find a feasible point, and 
the second of which is to minimise the objective function over the feasible 
set. Once an iterate is feasible, all subsequent iterates remain feasible. The 
conditioning of the problem may be improved by rescaling the variables so 
they are all of similar Euclidean size at the optimal point. This is usually 
difficult to arrange, and was not done here. We will give examples later but 
first we will present another method which can be much more efficient. Recall 
that our solution agrees in the first -|- 1 terms of its singular function 
expansion with the TSVD solution. The solution is chosen, from the wide 
range of positive solutions, to be the one of minimum 2-norm which satisfies 
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the constraints. Note that if the TSVD solution should turn out to be non- 
negative then this solution will coincide with it. This is a huge advantage 
over other norm choices and other methods for imposing positivity. If the 
object were to lie entirely within the bandlimit of the system then our way of 
imposing positivity would not alter the TSVD solution, since that is already 
the minimum 2-norm solution for the case where the first 7V-I-1 coefficients are 
specified. Adding in amounts of higher-order singular functions must increase 
the norm since by Parseval’s Theorem the 2-norm is given by the sum of 
the squares of the coefficients in the singular function expansion. There is 
no reason to suppose that other methods will deliver the TSVD solution 
if it is non-negative. Note, however, that in the absence of additional prior 
information there is no reason to suppose that the minimum 2-norm solution 
is any more likely to occur than any other one for which the first -|- 1 
singular function coefficients have the same values. Another important point 
to note is that in the presence of noise there can be some (noisy) values of the 
first few singular function coefficients for which the problem does not have a 
solution. 



8 The Dual Problem and Its Solution 

In many cases a primal quadratic programming problem has an equivalent 
“dual” problem. The dual problem is finite-dimensional and has the advan- 
tage of having a small number (iV -|- 1) of free variables. The relevant form of 
duality is Fenchel (or conjugate) duality. For a detailed discussion see [34]. 
The dual problem is given by [35], p. 62) 



2 ^ ' 

1=0 

where the vector A contains the dual variables and the -I- subscript indicates 
that the positive part of the term in brackets must be taken. We recall that 
the positive part of a function f is defined by 

(/)+ = /(a;), /(a:) > 0 

(/)+=0, /(x)<0 (29) 

The transpose — >■ L?' {F {min) , F {max)) of A (as defined in 

(18)) is given by 



N 

{A^li){F) = '^u,{F)fii, 
z=0 



N+1 



ji ^ R 



(30) 
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Hence the dual problem may be written as 
AS-R^+i 2 . 

At the minimum the gradient of this function must be zero. The gradient 
is given by [36] 

-d + A{A^X)+ (32) 



So, at the minimum, 



-d+ A(A'^A)+ = 0 (33) 

The Hessian is given by (again see [36]) 

= J Ui{t)uj{t)dt 0 < i,j < N (34) 

i:(A^A)(t)>0 

SO that the gradient may be written as 

g{\) = -d+H{\)X (35) 

and equation (33) becomes 

H{X)X = d (36) 

Let A°P‘ be the solution to the dual problem. Then the solution to the 
primal problem is given by 



N 

( 37 ) 

i=0 

(see [35], p. 61, (7.2) with q = 2). Note that we must take a positive part to 
get this solution. 

A unique solution for the dual problem is guaranteed if the singular func- 
tions are so-called “pseudo-Haar” . This is true if they are analytic functions 
which is the case here. 

It is clear that the singular functions will be analytic if the kernel of the 
problem is analytic. As a consequence the dual method is applicable to a 
wide range of linear inverse problems. 

A proof that the pseudo-Haar property guarantees uniqueness of the dual 
optimum may be found on p. 62 of [35]. Key requirements for using the dual 
method are that the primal and dual optimal values must be equal, the dual 
optimum must be attained and the primal solution must be obtainable from 
the dual one. Various results in [35] and [37] applied to the problem discussed 
here may be used to show when these requirements will be met. 
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Turning now to the implementation of the dual method we note that the 
Hessian is positive semi-definite so that the dual objective function in (28) is 
convex. Furthermore, for the case when the singular functions are bounded 
and analytic it is simple to show that the dual objective function together 
with its gradient and Hessian are all continuous functions. As a consequence, 
for this case we can use a smooth optimisation method. For a start one can 
try to solve the dual problem using Newton’s method 

^nen, ^ yold _ (3g) 

implying 

H(^X°^d)yuew ^ jj^yld^y^old _ (gg) 

Using (35) this becomes 

H(^yld)ynew ^ ^ 

The iteration is terminated when the gradient is sufficiently small since we 
know that the gradient must be zero at the optimum. As is well known, 
Newton’s method is not guaranteed to converge unless the function being 
minimised is quadratic. However, if one starts sufficiently close to the opti- 
mum the method converges quadratically. 

We have implemented alternative methods possessing better convergence 
properties such as the Broyden, Fletcher, Goldfarb and Shanno and Davi- 
don, Fletcher and Powell quasi-Newton methods with cubic polynomial line 
searches. Again these are terminated when the gradient becomes sufficiently 
small. One could also try other conjugate direction techniques such as con- 
jugate gradient methods but we have chosen not to do this since the quasi- 
Newton methods are generally more robust. 

There are standard convergence results for all these algorithms. However, 
the results rely on the starting point being sufficiently close to the optimum 
and for a given starting point this is a difficult condition to check. If one insists 
on global convergence results before using an algorithm then the Levenberg- 
Marquardt method, detailed in [38], pp. 312-313, can be used. This is a 
modification of Newton’s method in which a small multiple of the identity 
matrix is added to the Hessian for those steps where it is ill-conditioned. 
This becomes important when noise is present. In the present work (40) was 
solved using SVD. 

9 Results 

To proceed further we have written a computer programme to simulate 
particle-size data which consists of a small number of narrow peaks with 
selectable positions, amplitudes and widths. The photon correlation function 
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can have selected values of G^^^(O) and has Poisson noise due to the photon 
statistics. 

In Fig. 1 we show results for simulated distributions consisting of two 
equal-height peaks, of equal relative widths of 10% of the mean, spaced at 
dilation ratios 1.6, 2.0, 2.4 and 2.8. The Poisson noise corresponded to 3.10^ 
counts in the first delay channel of the correlation function which would be the 
case for a typical experiment of approximately 30 seconds duration. We show 
both the TSVD(2) results using the new background subtraction method, in 
which the inversion is performed without first taking the square root, and the 
results using the dual programming method. In the simulations we started 
the dual method with the TSVD(2) solution as an initial estimate. At low 
dilation ratios a typical “pushing apart” of the peaks is apparent together 
with some negative regions with the TSVD(2) method. We have found that 
the TSVD(2) method almost always gives a significantly better result than 
the original TSVD method and the dual method is even better again. The 
peaks are closer to their correct positions, the relative heights are correct and 
the solutions are data-consistent. 

The primal and dual methods for calculating the minimum 2-norm po- 
sitive solution produce the same solution, but by very different algorithms. 
It is important to look at the computational complexity of each method to 
see when it becomes more efficient to use one as opposed to the other. The 
computational time for the primal method scales roughly as the number of 
free variables in the problem. This is equal to the number of points in the 
final discretised solution; for our problem we only used 21 so that the primal 
problem would not be too expensive. 

The most expensive step in the dual method is the solution of an IV -I- 1 x 
IV -|- 1 system of equations, where IV -|- 1 is the number of singular functions 
used in the truncated singular function expansion solution. In our case we 
used 7. This procedure scales as {N + 1)^ (in the routine to find the LU 
factorisation of the matrix H). For a one-dimensional primal problem with 
-|- 1 equality constraints the Newton iteration involved in solving the dual 
problem typically converges within 3(iV-|-l) steps. It should also be reiterated 
that the dual method manages to solve the large dimensional primal problem 
with a significantly smaller number of variables. 

So, for a problem where there are few known components in the trun- 
cated singular function expansion and a large number of points in the final 
reconstruction is desired, the dual method should be used in preference to 
the primal. This will almost always be the case and is particularly true for 
the PCS inversion problem. However, if one knows a large number of singular 
function coefficients accurately, it may be that the primal method would be 
faster than the dual. It should be remembered, though, that an SVD analysis 
is unlikely to be used in the case of large values of IV -I- 1 since the reason 
for using singular function methods is to overcome the ill-posedness in the 
problem, and if -I- 1 is large the problem is not very ill-posed. 
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Fig. 1. Inversion results for the dual and the TSVD(2) methods plotted for distri- 
butions with two equal-height peaks at dilations (a) 1.6, (b) 2.0, (c) 2.4 and (d) 
2.8. Noise is Poisson corresponding to = 3.10^. 

10 Conclusions 

We have discussed the problem of deducing the particle size distribution by 
the numerical inversion of the data obtained in a photon correlation spec- 
troscopy experiment. The basic physics and experimental techniques were 
reviewed and the ill-posed nature of the inversion problem was described. 
The inversion technique using singular value decomposition was outlined and 
recent theorems from the mathematical literature were employed to extend 
this method to obtain a positive solution while ensuring the preservation of 
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the original singular function expansion coefficients. The results of a simu- 
lated experiment showed a considerable improvement in the fidelity of the 
reconstruction although, as in all such inversion problems, this performance 
cannot be guaranteed for any arbitrary object function. 
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Abstract. The measurement of Mueller matrices provides a useful way of increa- 
sing the information obtainable from scattering experiments. Nevertheless, the ana- 
lysis of this information is not a simple matter because of the complicated relations- 
hips between the elements of these matrices. This chapter deals with the properties 
of Mueller matrices and with the analysis of physical parameters measurable by 
polarimetric techniques. First, the case of scattering systems that do not reduce 
the degree of polarization of totally-polarized incident light is studied. The “pure 
Mueller matrices” corresponding to these kinds of system are studied and classified 
by means of their polar decomposition. The general structure of Mueller matrices 
is then studied and is applied to the “parallel decomposition” of the scattering sy- 
stem. A comparison of Mueller matrices with “Stokes matrices” , the Transmittance 
Condition, a Purity Criterion, the Degree of Purity and Polarizance parameters are 
also dealt with. 



1 Introduction 

Polarimetric techniques are useful in order to increase the information ob- 
tained in scattering measurements. The effects of a scattering system in the 
polarization of light beams that interact with it can be expressed mathemati- 
cally by means of the so-called Mueller matrix (or “scattering matrix”). The 
polarization state of a light beam can be represented by means of the Stokes 
vector, constituted by the four real Stokes parameters. The Mueller matrix 
is a real 4x4 matrix that gives the linear transformation of the Stokes vector 
of the incident beam into the Stokes vector of the emerging beam. So, the 
Mueller matrix depends on the particular interaction conditions as well as 
the spectral characteristics of the incident light. 

Thus, the 16 elements of the Mueller matrix contain all the information 
obtainable by means of polarimetric measurements. These elements are not 
totally independent in the sense that they are restricted by several bilinear 
and quadratic inequalities. In certain important cases some of these inequali- 
ties becomes equalities and the effective number of free parameters is reduced 
[ 11 - 

In consequence, a high degree of knowledge of the relationships between 
the elements of a Mueller matrix is important for analyzing the information 
contained in such matrices obtained from experimental measurements. Muel- 
ler matrix measurements allow us to identify and analyze properties such as 
diattenuation, birefringence and depolarization. These properties are related 
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with the nature and distribution of the scatterers that constitute the whole 
scattering system [2,3]. 

From a theoretical point of view, the interest comes from the necessity 
of establishing models that fit with the differences in behavior exhibited by 
the scattering systems. The main problem basically consists of extracting 
and understanding the physical information contained in the elements of a 
Mueller matrix. 

This chapter deals with the properties of Mueller matrices and with the 
analysis of physical parameters measurable by polarimetric techniques. In 
Sect. 2, the case of scattering systems that do not reduce the degree of po- 
larization of totally-polarized incident light is studied. The “pure Mueller 
matrices” [4,5] corresponding to this kind of system are studied and classified 
in Sect. 3 by means of their polar decomposition. The general structure of 
Mueller matrices is studied in Sect. 4, and is then applied to the “parallel 
decomposition” of the scattering system (Sect. 5). Section 6 is devoted to cla- 
rifying the relationship between the set of physical Mueller matrices and the 
set of “Stokes matrices” mathematically defined as the matrices that trans- 
form any Stokes vector into another Stokes vector. The conditions derived 
from the restriction that the transmittance of the passive optical system is 
not higher than 1 are considered in Sect. 7. In Sect. 8 we deal with a “Purity 
Criterion” for Mueller matrices, as well as purity and polarizance parameters. 



2 Basic Transformation of Polarization 

First, let us consider the Jones formalism in order to represent the effects of 
a deterministic-nondepolarizing optical system. It should be noted that we 
use the term “deterministic-nondepolarizing” in the sense of totally polarized 
incident light always emerging totally polarized. This observation is impor- 
tant because the degree of polarization of a partially polarized light beam 
can decrease when it interacts with some kinds of deterministic optical sy- 
stems[6] (for example a diattenuator). Furthermore, there exist deterministic 
optical systems that can depolarize totally polarized polychromatic light [7], 
resulting in an emerging light beam that can not be described by a stable 
Jones vector, but by the corresponding Stokes vector. Thus, we can say that 
any deterministic-nondepolarizing optical system can be represented by its 
corresponding Jones matrix. This circumstance occurs in several phenomena 
such as scattering by a single particle or by a collection of identical particles 
[2]. Let us consider the analytical signal representations £i(t), £ 2 {t) 

£i{t) = Ei{t) + 

£2{t) = E2{t) + lE2{t) . ( 1 ) 

where Ei{t), E 2 {t) are the real components of the electric field, and Ei{t), 
E 2 {t) are its Hilbert transforms [8,9]. 
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It is well known that when light incides on a linear passive optical system, 
the transformation of the field state can be written as 



e'{t) = Ts{t), 



( 2 ) 



where 



s{t) = 





(3) 



are the emerging and incident Jones vectors and T is the Jones matrix of the 
system. 

In general both emerging and incident fields can fluctuate. Now we con- 
struct the polarization matrix [8], namely, the 2 x 2-covariance matrix of the 
two variables Si{t),e 2 {t) 



<l> = {s 1 ^) 



/ SiSl £ i £ 2 \ 

\^£2£j £2£2 / ’ 



(4) 



where the angular brackets denote time averaging, ® denotes direct (Kronecker) 
product, denotes the complex conjugate transpose of £, and e* denotes the 
complex conjugate of £j. 

The relation between (p and the corresponding Stokes parameters is given 
by [10,8] 



<l> = 




k=0 



1 / So + Si So -I- iS3 

2 V So - is3 So - Si 



(5) 



where ak{k = 1, 2, 3) are the Pauli matrices and cto is the identity matrix 




The intensity and the degree of polarization are invariant under transfor- 
mation of the reference axis, and thus have relevant physical meaning. They 
can be expressed as 



I = Tr^, 



(7) 



4det^ 

(Tr<l>)^ 



P = 



(8) 
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The polarization matrix of the emerging beam is 

<p' = {e' (g) e't) = ((Te) (g> (Te)^) = (Te 0 e1'Tl') = T(e 0 e1’)Tl' = . 

(9) 

In order to make subsequent straightforward calculations, we introduce 
the following alternative notation for the elements of ‘P (which we will also 
use, where convenient, for other 2x2 matrices) 



^0 = ‘fll, = Vl2 1 V2 = ^21, 



(/?3 = if22- 



( 10 ) 



The elements of the polarization matrix can also be written as a column- 
vector ip [11] defined by 



if = 



/ ‘Po\ 




( ei£i \ 


Ti 




£i£2 


T2 




£2£! 






\£2£2/ 



= (£ 0 £*)• 



( 11 ) 



The polarization vector ip and the corresponding Stokes vector s are re- 
lated by means of the following expression 



s = Aip, A = 



/I 0 


0 


1\ 


1 0 


0 


-1 


0 1 


1 


0 


1^0 i 


—i 


0/ 



(12) 



The polarization vector ip' of the emerging beam is given by 

if' = {s' 0 e'*) = {{Te) 0 (Te)*) = {{Te) ® (T*e*)), (13) 

which, applying the properties of the Kronecker product, can be written as 
= {{T 0 T*) {e 0 £*)) = (T 0 T*) ((£ 0 £*)) = (T 0 T*) <p, (14) 



or as 



ip' = Vif,V ={T(^T*), (15) 

where the deterministic-nondepolarizing optical system is represented by the 
“complex Mueller-Jones matrix” V [12]. The corresponding Mueller-Jones 
matrix N of the optical system is related to V and T by[12][13] 

N = AVA~^ = A{T(^T*)A-\ (16) 



or, in components 

nu = (akTaiT^) , (fc, / = 0, 1, 2, 3) . 



(17) 




4 Mueller Matrices 



67 



In the next section we will deal with optical systems whose effect on pola- 
rized light can not be represented by means of Jones matrices, but can be re- 
presented by Mueller matrices. Therefore we will distinguish between Mueller- 
Jones matrices (“pure Mueller matrices”), which correspond to deterministic- 
nondepolarizing optical systems, i. e., characterizable by Jones matrices, and 
Mueller matrices in general. For reasons of clarity, hereafter we will use the 
following notation for the different types of matrices: V, complex pure Mu- 
eller matrices; L, complex Mueller matrices; JV, pure Mueller matrices, and 
M, Mueller matrices. The justification for using “pure” to refer to certain 
sort of matrices will be clear from subsequent sections. 

It is now pertinent to consider the reversibility properties of the matrices 
that represent optical systems, i. e. the operation of interchanging the incident 
and emerging light beams. For a given optical system characterized by a Jones 
matrix T, the Jones matrix representing the same system when the incident 
and emerging beams are interchanged is [14] (absence of Faraday effects 
assumed ). For the complex Mueller-Jones matrix V(T) it is easy to prove 
that V^(T^) = V'^{T), and that the corresponding reverse Mueller-Jones 
matrix is given by[15] 

= N{T^) = QN'^Q, (18) 

where Q is the diagonal matrix ZJ(1,1,1,— 1) that performs the sign change 
of the element S3 in the reverse interaction. 



3 Polar Decomposition of Pure Matrices 



Let us now consider the polar decomposition of Jones matrices and pure 
Mueller matrices [16-18]. Any Jones matrix can be written as J = JpJr, 
or J = J^Jp, where Jp, Jp are Hermitian matrices and Jp, Jp are unitary 
matrices. 

In the Jones formalism, Hermitian matrices are associated with diattenua- 
tors (partial polarizers). These matrices can be written as 



Jp (a, S, ki, k2) 



f {ki-k2)caSae 

V (^1 - ^2) kisl + /C2C^ J 

(ca = COS a, Sa = sin a) . 



(19) 



ki,k 2 (fci > /C2) are the principal coefficients of amplitude transmission for 
the two orthogonal polarization eigenstates of Jp. Their azimuths (namely y 
and X J- f ) and their ellipticities (w and ) are given by 



tan 2y = tan 2a cos 6, 
sin 2oj = sin 2a sin S. 



(20) 
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This kind of Jones matrix satisfies det Jp = kik 2 - 

Interesting specific cases are: <5 = 0, linear diattenuator (linear partial 
polarizer); k 2 = 0, total polarizer. 

The unitary Jones matrices are associated with pure retarders and can be 
written as 






y e*'>'s/3C/3 s^e*T+c|e“*T j 

(c/3 = cos/9, S /3 = sin/3). (21) 



A stands for the retardation caused by the retarder between the two 
elliptical polarized eigenstates of Jp. The respective azimuths jp, ip + ^, and 
ellipticities {p, —v) of these eigenstates are given by 



tan 2^ = tan 2/3 cos 7, 

sin 2v = sin 2/3 sin 7. (22) 

Interesting specific cases are: v = Q, linear retarder; jz = ±|, /3 = |, 
circular retarder. 

The polar decomposition of a Jones matrix shows that the polarization 
effects of any “pure” scatterer are equivalent to that given by a system com- 
posed of a serial combination of an elliptical diattenuator and an elliptical 
retarder in either of the two possible relative positions. These results can be 
applied to the Mueller representation, so that a pure Mueller matrix N can 
be written as N = NpNp, or N = N^^Np, where Np, Np are real symmetric 
matrices and Np, Np are orthogonal matrices. 

Although the equivalent system is not unique, the physical parameters 
(a, S, k\, k 2 , A), {a', 6', k'i,k'2,f3', 7', A') corresponding to the components 

of the respective “equivalent systems” provide all information obtainable by 
polarimetric techniques, and are useful in order to study properties of the 
scattering system. 

The four parameters of the “equivalent diattenuator” are obtained through 
the following expressions [17] 





ki = 


k[ = 


0 

0 


+ q) ) 








k2 = 


k'2 = 


(noo 


-q) . 




(23) 


sin 2a = 


(rio2 




1 


cos 2a = 


_ noi 


(24) 






q 






q 


sin 2a' = 




+ nlo 


1 


cos 2a' = 


_ nw 
~ •) 


(25) 



, ZjW , 

q q 
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tan 




nso , 
n2o ’ 




(26) 



where 



2 2 I 2 I 2 2 I 2 I 2 

q = tiqi + riQ 2 + ~ ''^lo + ^20 + ^ 30 ■ 

To obtain the parameters of the “equivalent retarder” it is necessary to 
distinguish two cases depending on the value of det N. 

a) det N 0 

Since detA'^A = 1, then detA'p = det iVp = det yf 0. It can thus be 
proved that the inverse matrix of A^p is 

[iVp(A:i,fe,a,<5)]-' = iVp • (28) 

Although this is not a physical Mueller matrix (because ^ > 1, ^ > 1), 
the Mueller matrix of the equivalent retarders can be obtained as 

= = . ( 29 ) 

Once Nn and A^p (whose elements are denoted by , rb respectively) are 
calculated, the parameters {A, p, tp) , {A', v', ip')of the equivalent retarders are 
given by [17] 

cos^ (^) = jTrA^«’ ( y) = (80) 



sin 2iy = 



ri2 - V21 



sin 2iy = 



/ '12 '21 



2sin(f)’ 2sin(f)’ 



(31) 



sin 2pj = 



rsi - ri3 



sin 2ip' = 



'31 '13 



2 cos 2v' sin A' ’ 



2 cos 2v sin A 

which leads to 

cos 2/3 = cos 2v cos 2ip, cos 2/3' = cos 2 p' cos 2\p' , 



(32) 



(33) 



tan 7 



tan 2u 
sin 2ip ’ 



tan 7 ' 



tan 2 p' 
sin 2%p' 



(34) 



b) det N = 0 

In this case /c 2 = 0 and A^p, A^p are not unique. An arbitrary value can be 
chosen for 7 , for example 7 = 0 . Thus, the Mueller matrix N can be expressed 
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in terms of equivalent systems composed of a total polarizer and a linear re- 
tarder N = Np (a, S, ki,0) Np 0, Z\) ; N = N'^ (/3', 0, A') Np (o', 6, ki,0). 
Now /?, f}' are given by 



tan 2/3 = tan 2/3' = 



nio — noi 



^02 — ^20 

and A {A') have two possible solutions Ai,A 2 {A[,A' 2 ) 

bnso - ano3 



tan Ai = tan Z\ ^ = 



62 - ngg 



(35) 



(36) 



■ /I . f A/\ ^30 + ano3 

tanZl 2 = tan (tt - Zlo) = — , — , (37) 

62 - n23 

where 

a = noi sin /3 — rio 2 cos /3, 6 = Uiq sin j3 — n 2 o cos (3. (38) 

In the case of a = nos, these results must be replaced by Z\ = Z\' = 

The obtainment of the physical parameters of the equivalent system corre- 
sponding to experimentally measured pure Mueller scattering matrices, pro- 
vides an appropriate way of inspecting the polarimetric properties of the 
sample, avoiding potentially erroneous analysis due to the complicated im- 
plicit relationships between the elements of the Mueller matrix. Thus, when 
the experimental conditions of scattering ensure the coherent superposition 
of the emerging light beams, the Mueller matrix of the system is a pure Mu- 
eller matrix and this procedure can be applied. Moreover, several equivalence 
theorems, such as those summarized below, could be useful in the analysis of 
pure Mueller matrices[14][19]. 

• “Any serial combination of retarders is optically equivalent to a retarder 
(in general elliptical)” . 

• “Any serial combination of retarders is optically equivalent to a serial 
combination of two linear retarders (the solution is not unique)”. 

• “Any serial combination of retarders is optically equivalent to a serial 
combination of a linear retarder and a rotator” . 

• “Any elliptical diattenuator is optically equivalent to a linear diattenua- 
tor placed between two identical linear retarders with orthogonal bire- 
fringence axis”. 



4 Mueller Matrices for Incoherent Scattering 

Depending on the characteristics of the scattering system as well as the cohe- 
rence of the light, for certain directions scattered light could be a superposi- 
tion of several incoherent wavelets. In this case the optical system can not be 
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represented by means of a Jones matrix nor a pure Mueller matrix, but can be 
considered as an ensemble [20-22] composed of deterministic-nondepolarizing 
(pure) elements, each one with a well-defined Jones matrix, in such a manner 
that the incident light beam is shared among these different elements (or 
particles). Let li be the intensity of the portion of light that incides on the 
“i” particle. The ratio between li and the intensity I of the whole beam is 
denoted by a respective coefficient pi so that 

Pi = y, 

i 

Now we denote by 14(0 and the matrices representing the “i” 
element in the different formalisms (i. e.: Jones matrix, complex Mueller 
matrix and Mueller matrix). Thus, for a given scattering direction, the Jones 
vector of the emerging wavelet is 

£'(t) = t(*) (v/^£(t)) , (40) 



and the corresponding polarization vector is given by 

= (£' 0 e't) = ® (r(*)v^e(t))*) 

= (g) (e (g) £*)) = (41) 



The polarization state of the complete emerging beam (for the scattering 
direction in question) is given by the incoherent superposition of the beams 
emerging from each element. Then, the polarization vector of the emerging 
beam is 



ip' = Lp; L = 




(42) 



showing that the corresponding Mueller matrix M is given by 



M = a( J A~^ = J2p^ (43) 

\ i / i i 

We have obtained this result by considering the optical system as com- 
posed of a set of parallel pure elements, but it is worth pointing out that the 
same result could be obtained by considering the system as an ensemble [23] 
so that each realization “i”, characterized by a well-defined Jones matrix 
occurs with a probability Pi-We can therefore consider the optical system as 
composed of such an ensemble, and will refer to ensemble averages by me- 
ans of brackets (x) = so that {N) = or, in components 
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5 Parallel Decomposition of Mueller Matrices 



By analyzing the expressions of the elements of the matrix L obtained in the 
previous section 



/ {toQ {toil) {hQ 
(tot*) {tot* 3 ) {ht*2) (titt) 
{t 2 tt) iht*o) ihtl) 
\{t2t*2) {t3t*2) {UDJ 



(44) 



we can clearly see that its elements can be suitably reordered to construct a 
new matrix H whose elements are defined by 

hu = \{tkti); k, I = 1,2, (45) 



This structure corresponds to a positive-semidefinite Hermitian matrix 
[24], consequently characterized by four non-negative eigenvalues. The expli- 
cit relation between M and H is given by the following expressions 



/ moo + rnoi 

+Wio + mii 



mo2 + mi2 

+i (W03 + TOls) 



17120 + m 21 

-i {moo + W31) 



m22 + 17133 \ 

+i {17123 - m 32 ) 




mo2 + 17112 moo - moi 

-i (too 3 + mi3) -l-mio - mn 



m22 - m33 

-i (m 23 + ms2) 



m2o - m2i 

-i {17130 - m 3 i) 



W 20 + m2i m22 - ms3 

-l-i (m3o -I- m3i) -ft (m 23 + m 32 ) 



moo + moi 

-mio - mil 



mo2 - mi2 

+* {mo3 - mi3) 



17122 + TO33 m2o - m2i 

\-i{m23-m32) +t(m3o-m3i) 



TO02 - mi2 
-i (mo 3 - mi 3 ) 



moo - moi 
-mio-fmii / 

(46) 



M = 



0 

0 

+ 


1 

0 

0 


-f/l 22 + ^33 


-f tl 22 ~ ho 3 


hoo + ^11 


1 

0 

0 


— tl 22 — ^33 


CO 

CO 

+ 

1 


ho2 + h20 


ho2 + h20 


+ hi 3 + hoi 


—hi 3 — hoi 


t {ho2 — tl 2 o) 


t {ho2 — h2o) 


\ -ft {hi 3 — hoi) 


—i {hio — hoi) 



hoi 


+ 


hio 


—i {hoi - 
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Let us note that H determines the Mueller matrix uniquely and vice-versa, 
and that the relation can also be written as 

1 ^ 

^ X! "^kiEkh (48) 

0 

where Ej^i {k, Z = 0, 1, 2, 3) denotes the set of 16 Hermitian and unitary Dirac 
matrices 



Eki = (Tk®<Ji, (49) 

which constitutes a complete set for expanding a Hermitian operator into 
a linear combination with real coefficients. These coefficients are 16 directly 
measurable magnitudes, namely the 16 elements of the Mueller matrix M 
associated with El. The Dirac matrices (as well as the Pauli matrices) are 
trace orthogonal, in the sense that the trace of the product of any two of 
these operators is equal to zero, and constitute a complete set of base matri- 
ces. This expansion provides a fundamental significance for the elements of 
the Mueller matrix, in addition to their phenomenological significance. The 
relation between the matrix H and its corresponding Mueller matrix is, in 
fact, analogous to that between the polarization matrix and its corresponding 
Stokes parameters. It is possible to classify the Mueller matrices according 
to the rank of H , i. e. according to the number of non-zero eigenvalues. For 
example, H corresponds to a pure Mueller matrix when only one eigenvalue 
is non-zero. Since H is Hermitian, it can be diagonalized through a similarity 
transformation, given by a unitary matrix U, such as 

H = UAU~^, (50) 

where A is the eigenvalue diagonal matrix D (Aq, Ai, A 2 , A 3 ), with Aq > Ai > 
A 2 > A 3 . 

Then, we can write H as 

H = XoUD{l, 0, 0, 0)C/"^ -k AiC/D(0, 1, 0, 0)C/"^ 

+X2UD{0, 0, 1, 0)D-^ -k XsUDiO, 0, 0, l)U~\ (51) 

which is a linear combination, with non-negative coefficients, of four ma- 
trices, each with only one eigenvalue equal to 1. By carrying this result to 
the Mueller formalism, we get 



M — XqNq -k AiA^i -k X 2 N 2 -k X 3 N 3 . (52) 

Thus, we see that any Mueller matrix can be decomposed into one to four 
pure Mueller matrices weighted by the eigenvalues of H. This decomposition 
provides a physical interpretation analogous to the decomposition of a pola- 
rization matrix into a superposition of two orthogonal-polarized, incoherent. 
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linearly independent beams whose intensities are the two eigenvalues of the 
polarization matrix[25]. 

We see that the system can be considered a parallel combination of one 
to four pure elements and that an appropriate procedure of analysis of an 
experimentally obtained Mueller matrix is the following 

• From M, calculate H , 

• Calculate the eigenvalues of H and the eigenvectors matrix U, 

• Calculate the pure components Ni of the “parallel decomposition” of M, 

• Apply the polar decomposition theorem for each component Ni and ob- 
tain the parameters of the equivalent systems, 

• Interpret the characteristics of the scattering system in terms of these 
results. 

It should be noted that the equivalent parameters of the four parallel 
components are non independent. In fact, the total 16 free parameters are 
shared among the four parallel components in such a manner that Nq, Ni, 
N 2 and N 3 , depend on 7,5,3 and 1 independent parameters respectively. 

A peculiar characteristic of the above parallel decomposition is that the 
weights are the eigenvalues of H. Nevertheless there are infinite possible de- 
compositions where the weights are different from Ai. 



6 Mueller Matrices and Stokes Matrices 

In this section we refer to “Stokes matrices” for the set of matrices B cha- 
racterized by the fact that, for any Stokes vector s, the transformed vector 
Bs is also a Stokes vector. The implicit or explicit identification between the 
set of Stokes matrices with the set of Mueller matrices is the starting point 
of several works dealing with the properties of Mueller matrices. The general 
characterization of Mueller matrices through the non-negativity of the eigen- 
values of H enables us to analyze whether both sets are in fact identical. First 
of all, we can observe that, obviously, any Mueller matrix is a Stokes matrix. 
Nevertheless the converse is not true, i. e., there exist Stokes matrices that 
are not Mueller matrices. As a counterexample, let us consider the Stokes 
matrix 7A(1, 0,1,1) whose corresponding H matrix, calculated with Eq. 46, 
has negative determinant. We find that some properties arise from the pecu- 
liar structure of Mueller matrices and not exclusively from the fact that they 
transform Stokes vectors into Stokes vectors. 

7 Transmittance Condition 



As we have seen, the non-negativity of the eigenvalues of H matrices leads 
to restrictive inequalities between the elements of Mueller matrices. Here we 
will consider the restrictions derived from considering passive systems (such 
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as scattering systems) whose maximum intensity transmittance tmax is less 
or equal to 1. 

For a given Mueller matrix M, and an incident light beam with a Stokes 
vector s, the intensity transmittance “t” is given by the ratio of the exiting 
intensity Sq to the incident intensity sq 

^ _ Sq _ mopSQ + mpisi + mo 2 S 2 + W03S3 
So Sp 

The intensity transmittance averaged over all incident polarization states 
is equal to mpp, which is also the transmittance for unpolarized light (si = 



S 2 = S 3 = 0). The maximum and minimum of t 

1 . 

t+ = moo + (wpi + TOp2 + mpp) ^ , (54) 

1 

t- = mpp - (mpi + mp 2 + mpp) " , (55) 

are reached for the following respective incident Stokes vectors [26] 

/ 1 s T 

s+ = (^(mpi + mp 2 + mpp) " ,mpi,mp 2 ,mppj , (56) 

/ 1 s T 

s_ = (^(moi + mo 2 + mop)%-moi,-mp 2 ,-mp 3 j . (57) 

Thus, the “Transmittance Condition” can be expressed as 

t+ = mpp + (mpp + mp 2 + mpp) ^ < 1. (58) 

On the other hand, the condition > 0 does not lead to any additional 
restriction because any Mueller matrix satisfy the following inequality 

(m^i + m^2 + " < ™oo- (59) 



8 Purity Criterion and Purity Index 

As we have seen, incoherent superposition of the wavelets scattered in a 
given direction results in depolarization, and an equivalent system constituted 
by pure components requires parallel combination. Once a Mueller matrix 
is measured, it is very useful to apply a simple criterion in order to check 
(within the experimental limits of accuracy) if the matrix corresponds to a 
pure system or to a parallel incoherent combination. This “Purity Criterion” 
can be stated as the following theorem: “A necessary and sufficient condition 
for a Mueller matrix M to be a pure Mueller matrix is that M satisfies the 
equality Tr(M^M) = 4mpp ” [27]. 
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In general, any Mueller matrix satisfies Tr(M^M) < 4 ?tiqq, and the equa- 
lity is reached for pure Mueller matrices [20,27,28]. 

The terms of this condition can be expressed in terms of the Euclidean 
norm of M [27], 



W\l = ( E < I = [Tr (M^M)] ^ , ( 60 ) 

and in terms of moo, which coincides with the norm defined by the maximum 
of the elements of M, (it can be proved that moo > 0, woo > rriij), 



II^IL = "^00- 



(61) 



Then, the general condition results in the “Condition of the Norms” 

I|M||,<2||M|L. (62) 

These results have similar well-known conditions for Stokes vectors. In 
fact, the elements of Stokes vectors verify the condition s\ + S 2 + s\ < Sg, 
which can be expressed as 

INIe<V2|l«IL- (63) 



As with the degree of polarization of light beams, it is possible to define 
a non-dimensional parameter for measuring the “Degree of Purity” P (M) of 
an optical system represented by a Mueller matrix M[29] 



P{M) = 



Tr - mgo 

3m^o 



1 / 4Tr(g^) \1^ 

3 {Tr Hf ) 



(64) 



The value of P (M) is restricted by the following limits 



0 < P{M) < 1. 



(65) 



P (M) = 0 corresponds to a total depolarizer whose Mueller matrix ele- 
ments are zero except moo. 

P (M) = 1 corresponds to a pure Mueller matrix. 

Other kinds of parameters are related to the diattenuation effects. The 
“Polarizance” K, defined as the degree of polarization of the emerging light 
when unpolarized light is incident 



K = 



(woi 



+ ml2 







moo 



(66) 
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provides a measure of the polarizing power of the optical system. The “Re- 
verse Polarizance” [29] is defined in the same way 

AV s W. + "»lo + 

moo 

referring to the same system when the incident and emerging light beams are 
interchanged. 

The limits for the values of K and Kr are 

0 < R: < 1; 0<Kr<l. (68) 

In the case of pure Mueller matrices Kd = Kr, but in general their values 
can be different. 
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Abstract. Light scattered from particles near a plane interface have important im- 
plications in many branches of science. Applications range from radar detection and 
remote sensing to clean-room monitoring and quality control in the manufacture of 
silicon wafers. The most fundamental system of this type is that of a sphere near 
a plane interface. Calculating the scattering is complicated because the boundary 
conditions at the sphere and plane surfaces must be satisfied simultaneously, and 
these two systems represent two fundamentally different geometries. In this chapter 
we present an analytical solution to this problem that retains the physics and may 
be used as a basis for more numerically intensive techniques necessary when the 
system constituents become irregular. 



1 Scattering System 

Physically, we can consider four field components that interact with the sy- 
stem of a sphere near a plane interface shown in Fig. 1. For one component, 
the incident light strikes the sphere and is scattered into the far-held re- 
gion beneath the plane interface. For a second component, the incident light 
rehects off the plane interface before it is scattered by the sphere into the 
far-held region beneath the plane interface. For a third component, the inci- 
dent light strikes the sphere and is scattered toward the plane interface which 
rehects this light toward the far-held region beneath the plane interface. And 
a fourth component, the incident light rehects oh the plane interface before it 
is scattered by the sphere toward the plane interface which rehects this light 
toward the far-held region beneath the plane interface. The total scattered 
held can be found as the superposition of these four components, since it is 
widely known how a plane wave interacts with isolated spheres and plane 
interfaces. However, this “double interaction” model does not include the in- 
teraction helds, i.e., helds that are scattered by the sphere, rehected oh the 
plane interface and are once again incident upon the sphere. To include the 
ehects of this interaction, we must use more complicated models. 

In this chapter we discuss a multipole solution, based on image theory, to 
the problem of a sphere in close proximity to a plane interface. This type of 
solution is valuable, since it retains the individual held components interac- 
ting with the individual system components that many numerical techniques 
cannot. In this way, the physics is preserved. The incident, scattered, and 
interaction helds are expanded in terms of vector spherical harmonics ab- 
out coordinate systems centered on the sphere and its image. The boundary 



F. Moreno and F. Gonzalez (Eds.): Lectures 1998, LNP 534, pp. 81-96, 2000. 
© Springer-Verlag Berlin Fleidelberg 2000 




82 



G. Videen 




Fig. 1. Diagram of the scattering system. A sphere of radius Tsph, complex refractive 
index m-sph, lies a distance d beneath a plane interface separating a medium of 
refractive index mi below the interface {zi < d) from a region of complex refractive 
index m 2 above the interface {zi > d) . A plane wave, whose wave vector lies in the 
xi — zi plane and is oriented at angle 9o with respect to the zi axis, is incident upon 
the system. To help satisfy the boundary conditions, an image coordinate system 
{x 2 , 1/2, Z 2 ) is introduced a distance 2d above the coordinate system centered on the 
sphere 



conditions on each subsystem can then be satisfied individually. Because the 
field expansions are in terms of vector spherical harmonics, it is relatively 
easy to satisfy the boundary conditions on the spherical interface. To satisfy 
the boundary conditions at the plane interface, an image (or interaction) 
field is introduced that is physically equivalent to the scattered field from the 
sphere that reflects off the plane interface and is incident upon the sphere. 
For a perfectly conducting plane interface, this interaction field is the image 
of the scattered field from the sphere. When the half-space not encompassing 




5 Sphere Near a Plane Interface 



83 



the particle is dielectric or lossy, some approximation or numerical technique 
must be used to represent this interaction field. 



2 Field Expansions 

Figure 1 shows the scattering system. The isotropic, homogeneous sphere 
of refractive index rusph and radius Tgph is located in air, centered on the 
{xi,yi,zi) coordinate system located a distance d beneath a plane interface 
separating region 2 of complex refractive index m 2 (zi > d) from region 1 
of real refractive index mi (zi < d). We denote fci, k 2 , and kgph to be the 
propagation constants of waves in region 1, region 2, and within the sphere. 
To satisfy the boundary conditions at the plane interface, we introduce an 
image coordinate system {x 2 ,y 2 , Z 2 ), located a distance d above the plane 
interface on the zi axis. This coordinate system is the source of the image 
or interaction fields. Incident upon the scattering system is a plane wave of 
wavelength A whose wavevector ko is oriented in the x\ — z\ plane at angle 
9o with respect to the Z\ axis. The electromagnetic fields may be expanded 
in terms of the vector spherical harmonics centered about the j coordinate 
system (j = 1,2): 



{kv,) = 9, [zlf\kr^ (cos 9j) exp 

~(j)j \z^X^ (krj)fmn (cos 9j) exp (imcjjj) 



( 1 ) 



(kv,) = 



—z)^\krj)n{n + l)P™(cos0j) exp {im<j>j) 



+9j 



y(p)( 
’’n V 

1 d 

kvj dvj 



Tjzh'^krj) I fmn (cos 9j) exp {im(j>j) 



(2) 



rjZ^^\krj) 7r„„ (cos0j)exp(im</)j) 

where z^\krj) are the spherical Bessel functions of the first, second, third, 
or fourth kind {p = 1, 2, 3, 4), and 



7fm„ (cos 9) = P™(cos 9) 
sm 9 



fmn {cos 9) = — P™(COS0) 
d9 



( 3 ) 

( 4 ) 



P™(COS0) 



(2n + l)(n — m)l 
2(n + m)! 



P„™(COS0) 



( 5 ) 
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where P™(cos 6) are the associated Legendre polynomials. We assume a time 
dependence of exp(— iwt) throughout. 

To satisfy the boundary conditions, we take advantage of the linearity 
property of the electromagnetic fields and consider the components of the 
electromagnetic field individually. These components are each expressed as 
a multipole expansion of vector spherical harmonics, and the boundary con- 
ditions at the interfaces of the sphere and of the plane are satisfied simulta- 
neously. We will first consider the incident fields. 



3 Incident Field 



The incident electric field that strikes the sphere is expanded in vector sphe- 
rical harmonics about the sphere coordinate system as 

oo n 

Einc = XI XI {^ 1 ^ 1 ) ( 6 ) 

n—1 m——n 



where Omn are the incident field coefficients. This component of the electric 
field is made up of two parts: 1) the plane wave which strikes the sphere 
directly; and 2) the plane wave that strikes the sphere after it is reflected 
off the plane interface. We consider these subcomponents individually, and 
because we are dealing with a linear system, the incident held coefficients are 
each composed of two elements: 



(i) 

'mn 



(j)° 

'mn 






(7) 



where are the coefficients of the incident plane wave that strike the 
sphere directly, and are the coefficients of the incident plane wave 

that were reflected off the plane substrate. To be completely general, we 
consider two different polarization states of the incident held: 1) A unit- 
normalized TE-incident (transverse electric) plane wave of the form = 
exp ik\{zi cos 9o + x\ sin 9o)yi ; and 2) A unit-normalized TM-incident (trans- 
verse magnetic) plane wave of the form = exp ifci {z\ cos 9o + x\ sin 9o) x 
(xi cos 9o — Zi sin 9o). We denote the coefficients for these two cases with an 
additional superscript: Umn and We can And these coefficients 

using the orthogonality properties of the vector spherical wave functions. The 
first coefficient can be expressed as 



a 



(l)o,TE 

mn 



J ETE-M^^*(fciri)dE 
/M^Ufciri)-MW*(fc^n)dE 



(8) 



where the asterisk denotes the complex conjugate and the integral is taken 
over all space. With a little bit of algebra and calculus, we And 



(l)o,TE 



-2i" 
n (n -I- 1) 



Ann (cos ) . 



(9) 
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Similarly, the coefficients 



,(2)o,TE _ 






-2i^ 



/N^ (fcin)-NW*(fc^ri)dE n(n+l) 

It is a simple exercise of Maxwell’s equations to show that 



^mn (cos6»o). (10) 



„(l)o,TM ^ • (2)o.TE 
^mn 



( 11 ) 



and 



(2)0, TM 
^mn 



\a 



(l)o,TE 

mn 



(12) 



Finally, we must determine the reflected components of the incident held 
coefficients. The solution to the scatter of a plane wave by an infinite plane 
interface is a classic problem of electromagnetism solved by Fresnel in the 
19th century. The amplitude and phase of the reflected plane wave are de- 
pendent on its polarization state, incident angle, and the refractive index of 
the media at either side of the interface. The Fresnel reflection coefficients 
for a plane wave incident at angle a on the interface, traveling from medium 
i and reflecting off the interface at medium j are 



i?™(a) = 



rrii cos a — rrij — (jrii / rrij)'^ sin^ a 



nii cos a + rrij \/ 1 — {rrii jmjf' sin^ a 



(13) 



i?™(a) = - 



rrij cos a — rrii y^l — (mijmjf' sin^ a 
nij cos a + rrii ~ sin^ a 



(14) 



where the superscipts on the reflection coefficients denote the polarization 
state of the incident held. It should be noted that these particular sets of 
equations are valid when the reflected electric field component is pointed in 
the same direction as the incident electric held component for both TE and 
TM polarization states when a = 0. From the relation 



we And 



p„“(-^) = (-i)”^™^^r(^) 


(15) 


(-a;) = (-1)”+™ 7r™„ (x) 


(16) 


Tmn {-X) = (_l)”+™+l (a;) . 


(17) 
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Using these expressions, we can write the coefficients of the reflected field in 
terms of the coefficients of the direct held: 



(-i)-?+”+™ i?TE (2ifcidcos 6»o) (18) 

for the TE-incident plane wave, and 

’™ (-1)^+”+™ i?™ (6»o) exp (2\kidcos6o) (19) 



for the TM-incident plane wave. The total incident held coefficients can be 
written as 



„(i)TE ^ (i)o,TE 
^mn ^mn 



1 + ^te g^p (2ifcidcos 6o) 



(20) 



for the TE-incident plane wave, and 



„(i)TM ^ (i)o.TM 
'^mn ^mn 



1 -I- (—1)1+”+’” i?™ (0o) exp (2i/cidcos0o) 



( 21 ) 



for the TM-incident plane wave. Equations (20) and (21) are the vector sphe- 
rical harmonic expansion coefficients for the total held of a plane wave reflec- 
ting off a plane interface, and are nothing more than Fresnel theory expressed 
in a spherical coordinate system. The phase term exp (2ifcidcos0o) exists in 
these expressions because the coordinate system from which the waves are 
expanded is located a distance d beneath the plane interface. 



4 Fields at the Spherical Interface 

The next step in the derivation is to satisfy the boundary conditions at the 
spherical interface. All the components of the electromagnetic held must be 
expanded in vector spherical harmonics as we did in the previous section for 
the incident held. The scattered held component is expanded as 

oo n 

Esca = ^ ^ M(^), (ftiri) + CMU^Ui) . (22) 

n=l m——n 

(s') 

For the chosen time dependence, the spherical Hankel functions (fciri) = 
hn'^ (kiri) represent outgoing spherical waves and satisfy the boundary con- 
ditions at infinity. Because there is a pole at the origin, this expansion can 
only be used in the region exterior to the spherical interface. 

The electric fields inside the sphere volume (ri < rgph) are expanded in 
vector spherical harmonics about the sphere coordinate system as 

OO n 

Esph = ^ ^ (^sphri) + (fcsphri) 

n—1 m——n 



(23) 
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( 1 ) 

where Cmn are the internal field coefficients. The radial dependence of the 
vector spherical harmonic expansion must be in terms of Bessel functions 

(^sph^l) — jn (^sph^l) • 

Finally, an image or interaction field must be introduced to help satisfy 
the boundary conditions at the plane interface. This field is expanded in 
a series of outgoing spherical waves centered on the {x2,y2,Z2) coordinate 
system: 



E™a = E E + (24) 

n—1 m——n 

Physically, this field corresponds to the interaction of the sphere with the 
plane interface: the scattered field from the sphere reflects off the plane inter- 
face and interacts with the sphere again. To satisfy the boundary conditions 
on the spherical interface, it is convenient to expand this image field on the 
(xi,yi,zi) coordinate system. Such a transformation of coordinate systems 
can be performed, since the vector spherical harmonics represent a complete 
orthogonal expansion. Details of this expansion and satisfying the boundary 
conditions on the plane interface are discussed in the next sections. Because 
there is no pole in the interaction field when ri = 0, the radial dependence of 
the vector spherical harmonic expansion must be in terms of Bessel functions 
Zn '’ {kin) = j„ (kiTi) : 

oo n 

Ei^a = E E e«MW + (fcin). (25) 

n—1 m——n 

Satisfying the boundary conditions at the spherical interface is a relatively 
straightforward process. This has been done well in so many texts that it 
would be tedious to repeat the process. Since many applications call for scat- 
tering from nonspherical particles, it is worthwhile to outline a more general 
formulation of this scattering solution. The solution when the particle is sphe- 
rical will then be provided. We express the fields incident upon and scattered 
by the particle in vector spherical harmonics. Any field incident upon a three- 
dimensional particle 

Ei„c = pW MW (ftin) + (fciri) (26) 

has a corresponding scattered field, which can be expanded in terms of vector 
spherical harmonics 



OO n 



E = , , -I- , ,1 Ifeiril 

-■-'sea / ^ ^ \ymn-^ mnm'n' ' i^mn-^mnm'n' 

n' — l 

-I L(1) 7^21 I (2) 2^22 ,1 ,{k^v^^■ 

' \ymn mnm'n' ' rmn mnm'n' m'n' 5 



( 27 ) 
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(7) 

that is, the scattered field coefficients hmii are related to the incident field 
coefficients via some transition matrix (or T-matrix) 

00 n 

rjW ('28') 

^mn / V / V ^mn-^ mnm'n' ' ^mn-^ mnm'n' ’ 

n' — l m'=- — n' 

In a multiple system, like that of a sphere and a plane interface, the total 
field incident upon the particle is composed of the incident plane wave and 
the interaction field; and the scattered field coefficients are expressed as 

00 n' 00 n 

I’m = E E E E 

n' — l m' ——n' n —1 m— — 

(29) 

This equation expresses the scattering coefficients in terms of the T-matrix 
of the isolated particle. This is convenient, since T-matrix routines are rea- 
dily available for many particle systems. For certain types of particles, the 
T-matrix can be simplified. The T-matrix of axially symmetric particles 
obey ^or radially symmetric particles, = 

^nn' ^mm' »nd there is no mode mixing, i.e., = T^nm'n' = 0- 

Spheres are a subgroup of this latter particle type, and the T-matrix is simply 
composed of the Mie scattering coefficients fl : 

('‘tn'l 

mnmn Jn 

^sphV’n (^sph'^sph) (^I'^sph) (^l^sph) V’n (^sph^sph) 

^sph'^n (^sph'^sph) (^I'^sph) (^l^sph) (^sph^sph) 



.(1) 



.(1) 






mnm'n' 



.(2) 



.(2) 



T: 






mnm'n' 



rp 22 _ £2 /oi^ 

mnmn Jn \^-^J 

(^sph'^sph) '4^n (^l^sph) ^sph^n (^l^sph) (^sph^sph) 

(^sph'^sph) ^n (^l^sph) ^sph'0n (^l^sph) (^sph^sph) 

where (x) = xj„ (x) and (x) = xhn '^ (x) are the Riccati-Bessel functions 
and the primes denote derivatives with respect to the argument. The other 
terms of the T-matrix are zero. 



5 Translation Addition Theorem 

The translation addition theorem is used to translate a set of vector spherical 
harmonics from one coordinate system onto another coordinate system. We 
need to find a relationship between the two sets of interaction coefficients 
given by (24) and (25). The fields expressed by these equations are identical, 
but they are expanded about different coordinate systems. For a translation 
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along the negative axis a distance 2d with no rotation, the vector spherical 
harmonics are related by 

OO 

{k,r,) = ^ , (fcin) + , (fcin) (32) 

n' — l 



N® (hr,) = ' (kiri) • 



This relationship is valid in the region where ri < \2d\ , i.e., out to the 
pole located at r 2 = 0. We can use these equations to find the relationship 
between the dmh and Cmn coefficients. Substituting (32) and (33) into (24) 
and comparing with (25) yields 




(35) 

— 1 

and for a spherical particle, (29) can now be expressed as 

OO n OO ^ 

e=i: i: /i 

n—lm——n \ n'—O ) 

+ [ati+f^ I fl (36) 

I n'=0 J 

It is beyond the scope of this chapter to provide a complete derivation of 
the vector translation coefficients and however, it is certainly 

worthwhile to write out a set of relations from which these may be calcu- 
lated. The vector translation coefficients can be expressed in terms of the 
scalar translation coefficients that are used to translate the spherical 

harmonics from one coordinate system onto another coordinate system: 

OO 

h^^\kr 2 )Pn(cose 2 ) exp (im^ 2 ) = E 6>i) exp (im</)i) 

n^— 0 

(37) 




The scalar translation coefficients can be found via various recursion relations. 
The starting point for calculating the scalar coefficients is 




(38) 
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which results from the expression 



hi^\kir 2 ) = ^ (2n' + 1) 



From here, the relation 



^(o,n+i) _ 1 / 2n + 3 J , / 2n + 1 ^(o,n) 

“ (n+l)V2n'+l P V 2n'-l^”'-i 



/2n' + 1 
2n- 1 






/2n+ 1 
2n' + 3 



can be used to calculate the elements The remaining terms in the 

matrix can be calculated using the relation 

^ / (n' - m + l)(n' + m)(2n' + 1) (m-i,n) 

{n-m+l){n + m){2n' + 1) ^ ^ 



—2kid\ 



' {n' — m + 2){n' — m + 1) 

{n — m + l){n + m){2n' + l)(2n' + 3) 



-y(m— l,n) 



_ 2 ^ j / (n' + m)(n' + m- 1) 

^ y (n — m + l)(n + m)(2n' + l)(2n' — 1) 



It should be noted that 






We can find the vector translation coefficients using 

2k\d I in' — 771 + l){n' + m + 1 ) „(m,n) 
^ “PTTy (2n' + l)(2n' + 3) ”'+i 

2kid I {n' - m){n' + m) (rn,n) 

n' V (2n' + l)(2n'- 1) "'-1 ’ 






2ikiind ^(ra^n) 
n'{n' + 1) 



Finally, from (42)-(44), the following relations are valid: 






( 46 ) 
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6 Plane Interface 



The boundary conditions at the plane interface are the most difficult to satisfy 
because all the field expansions are in terms of vector spherical harmonics. 
In general terms we can introduce another transition matrix , which, 

for our purposes transforms any set of expanding spherical waves centered 
on coordinate system {x\,yi,Zi) incident on the plane interface 

Eflr = (fcin) + gW N® (fcin) (47) 

to its scattering response, a set of expanding spherical waves centered on 
coordinate system (x 2 ,y 2 ,Z 2 ) that are reflected by the plane interface: 



oo n 



n' — l m'——n' 

+ , (fciro) ■ 

' \^mn^ mnm'n' ' ^mn^mnm'n' m'n' i 



(48) 



that is, any held incident on the plane interface expressed by (47) is scattered 
by a held expressed by (48). For a plane interface, all elements = 

because of the system symmetry. 

We first treat the case of a perfectly conducting plane interface. At the 
interface (c 2 = ri, O 2 = n — Oi, 4>2 = ^i), the tangential components of 
the electric and magnetic fields must be continuous. We can achieve this by 
making the scattered held the mirror image of the incident held. Using (1), 
(2), (16), and (17), it can easily be shown that the boundary conditions are 
satisfied if the scattered held is of the form 



= Xu (-1)”+™+' Mi^l {k,V 2 ) + Xu (- 1 )"^™ (fcir-2) (49) 



or 



Xnur'W = Sun'Srum'S,, (-l)"+-+^‘ . (50) 

This is equivalent to the following relationship between the scattering and 
interaction coefficients 







and (36) can now be written as 



(51) 



OO n 



Xu^Y. E \Xn+YXT'^^[Xn'AX-'^-bXBX^''> 

n—1 m——n \ n' — l 

+ E IXu'BX^''' - Xu'AXY ] fl 



n' — l 



fn 

(52) 
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Equation (52) represents the final solution to the scattering problem of a 

(i) 

sphere near a perfectly conducting interface. The scattering coefficients bmn 
are written in terms of a set of known quantities: translation coefficients 

^ and \ incident field coefficients Umn, and Mie coefficients /^. 

By truncating the infinite series, we can find a solution through matrix in- 
version techniques. The number of terms necessary for convergence depends 
on the required accuracy. We can obtain adequate results by carrying the 
summation in n to 

N = x,ph + 4x1(1 + 2 (53) 

terms, where Xgph is the size parameter of the sphere. Note that this formula 
is independent of the separation distance d. 

Many real-world interfaces are not perfectly conducting. To solve this 
more general problem, we must either make some approximation or resort to 
numerical techniques. The simplest approximation one can make is to assume 
that for the purposes of the interaction only, the image fields are proportional 
to the scattered fields; i.e., 

ULn^'n' = (-1)"+'"+^' i?™ (0) . (54) 

The factor of proportionality chosen is the Fresnel coefficient at normal in- 
cidence. This means that the scattered fields from the sphere that reflect off 
the plane interface and strike the sphere again, strike the plane interface at 
normal incidence. Geometrically, there is some justification to this argument: 
rays emanating from the center of the sphere reflecting off the plane interface 
and striking the sphere can have a maximum incident angle on the plane 
interface of 30° , and the Fresnel reflection coefficients remain nearly constant 
at near-normal incident angles. Equation (54) is equivalent to the following 
relationship between the scattering and interaction coefficients 

d'ii = - (-1)”+’”+^' i?™ (0) (55) 

and (36) can now be written as 



‘■'di = EE 



{ + 1 ; (- 1 )"'+“ ( 0 ) 4 ”’"'’ 

I n '^1 

( oo 

+ + E (-1)"'+’” RJ 2 (0) 



n' — l 



(56) 

/n 

fu 



An exact treatment of the scattering from a sphere located near an ar- 
bitrary halfspace requires considerably more space than I am allowed in this 
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chapter; however, we can find one using the above derivation. The only re- 
quirement is finding the matrix elements . One way to do this is by 

expanding the individual vector spherical harmonics given by (47) as an angu- 
lar spectrum of plane waves. The reflected field is also expressed as an angular 
spectrum of plane waves and the angle-dependent amplitudes of the reflected 
field are equal to those of the incident field, mulitplied by the appropriate 
Fresnel reflection coefficient. Finding the matrix elements requires 

expressing the reflected angular spectrum in terms of the vector spherical 
harmonics. This final step is a minor variation on (8) and (10). Note that 
with little modification, the equations of this chapter can also be modified 
to accommodate irregularly shaped particles and roughness on the “plane” 
interface. The only knowledge required is the appropriate T-matrix of these 
constituents. This topic is treated using a slightly different method in the 
next chapter. 



7 Scattered Field 



For many applications, only the far-held region is of interest, where 



The scattered far helds can be expressed in terms of a scattering amplitude 
matrix that contains all the polarization information of the scattered light: 




exppfcin] ^2^3] {E™\ 
-ifcin 



(58) 



where if™ and if;™ are the amplitudes of incident TE and TM plane waves, 
and and Eg^ are the 9i and components of the scattered electric held. 

The total scattered held is composed of the linear superposition of the scat- 
tered held from the sphere Egca and from its image Ejma- We have provided 
only an explicit expression for Ei^a when the interface is perfectly conduc- 
ting (for the dielectric halfspace, the approximate coefhcients given are only 
valid for the purpose of the interaction). However, the contribution of the 
image held in the far-held is equivalent to the image of the scattered held 
multiplied by the Fresnel rehection coefhcient Rte (tt — 9i) or Rtm (tt — Oi) , 
and a phase shift exp {—2ikdcos9i) . We can express the image of the scatte- 
red held in terms of the scattered held using (16), and (17). Separating the 
individual vector components of (22) yields the following: 



oo n 

E (— i)” exp (im(()i) X (59) 

n—1 m——n 

1 1^1 -k i?™ (tt - 9i) (-1)”+”" exp {-2\kdcos 9{) (cos 6»i) 

-k [^1 - i?™ (tt - 6»i) (-l)”+™exp(-2ifcdcos6»i) (cos6»i) | 
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oo n 

S '2 = ^ ^ (— exp X (60) 

n—1 m——n 

{ [1 + - ^ 1 ) exp (-2ifcfiC0S 6»i) (cos 6»i) 

+ 1 - i?™ (tt - 0i) (-l)"^'"exp(-2ifcdcos0i) (cos6*i) | 

00 n 

53 = ^ ^ (-i)"+^exp(iTO(/)i) X (61) 

n=l m——n 

I 1 + i?™ (tt - 0i) (-l)"'''"*exp (-2i/c(icos0i) iimn {cos 9 1 ) 

+ l--R™(7T-6ii)(-l)”+™exp(-2ifc(icos6<i)6^^2i™^mn(cos6»i) | 

OO n 

Si = E E (— i)” exp (im(()i) X (62) 

n—1 m——n 

I 1 + i?™ (tt - 0i) (-!)"■'"’" exp (-2ifcficos0i) (cos 0i) 

+ 1 - i?™ (tt - 0i) (-l)"^'"exp(-2ifcdcos0i) (cos0i) 

The superscripts on the scattering coefficients bnm^ and bnm^ are the scat- 
tering field coefficients when the system is illuminated by TE and TM plane 
waves. These expressions represent the far-held solution to the problem of a 
sphere near a plane interface. 

Scattering amplitudes are convenient, theoretically, because they can be 
manipulated to produce measureable quantities like scattering intensities and 
polarization states. Figure 2 shows the scatter in the plane of incidence (^1 = 
0°) of r = 0.55 /rm spheres resting on a substrate illuminated at 9o = 30° 
and A = 0.6328 /rm. This hgure demonstrates the sensitivity of the scatter to 
system parameters. The TM polarization intensities are especially sensitive to 
a change in the halfspace complex refractive index (m 2 ) , since the TM Fresnel 
rehection amplitudes are more sensitive than the TE amplitudes. Many sizing 
routines are based on the positions and spacing of the maxima and minima 
intensities. This graph demonstrates that care must be used when applying 
such techniques to particles located on substrates. The spacings remain fairly 
constant for the TE illumination of Fig. 2, but change drastically for TM 
illumination. 

8 Conclusion 

In this chapter we have derived a solution for the scattering from a spherical 
particle near a perfectly conducting plane interface. The analytical solution is 
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Scattering Angle (degrees) 

Fig. 2. Light scattering intensities calculated for r = 0.55 /im spheres on a substrate 
illuminated at A = 0.6328 /rm, 6o = 30°, 0i = 0°, and d — 0.55 fim. Results are 
shown when the system is composed entirely of gold (rusph = 0.226 + 3.321) and 
when the system is perfectly conducting 



in terms of a vector spherical harmonic expansion. The equations can be mo- 
dified to accommodate an irregular particle near an irregular interface if the 
scattering T-matrix is known for these isolated subsystems. Additionally, the 
equations may also accommodate a non-perfectly conducting halfspace. An 
approximate solution of this latter case, based on a normal-incident Fresnel 
reflection of the interaction held, is provided. 
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Abstract. An analytical- numerical technique for the solution of the two-dimen- 
sional scattering of an electromagnetic wave by a set of circular cylinders in the 
presence of a plane discontinuity for the electromagnetic constants is discussed. 
Since the interface is only characterized by its reflection coefficient, a wide class of 
reflecting surfaces can be treated with the same formalism. The solution is obtain- 
able for both the near and the far region, regardless the polarization state of the 
incident wave. The method exploits the possibility of representing any two-dimen- 
sional field in terms of a suitable superposition of cylindrical waves. The expansion 
coefficients represent the unknowns in a typical scattering problem and can be de- 
termined by imposing the boundary conditions. The presence of the interface leads 
to the necessity of evaluating the reflected cylindrical waves, and this is achieved 
starting from the Fourier spectrum of the cylindrical functions on a plane. 



1 Introduction 

Cylindrical waves, expressed as the product of the Hankel function of integer 
order m [1] times the factor exp(imd), t? being the angular coordinate, are 
fundamental building blocks in constructing the solution of two-dimensional 
scattering problems [2]. Due to the fact that these waves are the eigenfunc- 
tions of the two-dimensional Laplace operator, they can be assumed as a basis 
for describing solutions of the two-dimensional electromagnetic problem con- 
stituted by the Helmholtz equation and the Sommerfeld radiation condition, 
although some care has to be used in choosing the convergence domain of the 
expansions [3,4]. 

In particular, such solutions are employed to describe the scattered field 
by cylinders in free space for incident plane waves [2] or Gaussian beams 
[5-7]. Moreover, circular cylindrical scatterers may be used to simulate the 
behavior of two-dimensional objects of arbitrary transverse shape, both in the 
small radius approximation [8] and in the generic radius case [9] . Gylindrical 
functions play a fundamental role also when numerical procedures are used, 

F. Moreno and F. Gonzalez (Eds.): Lectures 1998, LNP 534, pp. 97-1 11, 2000. 
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which allow the case of arbitrarily shaped cylindrical scatterers to be solved 
[4,10,11], 

In all the problems involving cylindrical structures, one has to resort to 
a suitable representation of the scattered field. A very common approach 
[12] is to consider this field as a linear combination of the above mentioned 
cylindrical functions. The coefficients of such expansions are the unknown of 
the problem and are determined by imposing the boundary conditions on the 
surfaces of the cylinders. 

The problem becomes more complex when a planar interface is present 
near the cylindrical scatterer. This is the case, for instance, of the radar 
detection of objects near the ground [13], the design of diffractive optical 
gratings [14], the study of quasi-optical launchers of lower-hybrid plasma- 
waves [15-19], and the study of surface contaminations in the semiconductor 
industry [20]. 

Several solutions can be found in the literature for problems of this kind. 
For example, an infinite wire grid parallel to the interface was treated by 
Wait for cases of both a perfectly conducting plane and a vacuum-dielectric 
interface [21-23]. Furthermore, for the case of cylinders in front of an in- 
terface, the problem has been approached with different techniques, using 
methods based on integral equations [13], coupled dipoles [24], the extinction 
theorem [25-28], or the image theory [29]. When the surface coincides with 
the interface between vacuum and a dielectric homogeneous medium or it is 
a real conducting plane, numerical methods are available also for non cilin- 
drical objects [30-33]. In addition, partially buried scatterers have recently 
been considered [34,35]. 

In general, when a planar interface is present near the cylindrical scatteres, 
the cylindrical symmetry is broken and an approach based on cylindrical 
waves seems to be less natural. Nonetheless, cylindrical functions assume 
an important role if use is made of their Fourier spectrum, inasmuch as the 
presence of the reflecting surface can be taken into account by considering the 
reflection of each elementary plane wave. This is the approach used by Wait 
[21-23] and Clemmow [36], but in their case the diameter of the cylinders is 
much smaller than the wavelength, so that only the cylindrical wave of zero 
order is needed to describe the scattered field. 

Here, we report the theoretical basis of a method, based on the use of cy- 
lindrical waves, for treating the two-dimensional scattering by a set of parallel 
cylindrical scatterers with arbitrary size, in the presence of a general, planar 
refiecting interface. In Sect. 3 the study of the scattering of a plane wave by 
perfectly conducting cylinders is treated, while in Sect. 4 the generalization 
of the method to dielectric (possibly lossy) diffractive cylinders is given. Sec- 
tion 5 concerns the case in which a general two-dimensional field is considered 
as the incident field. Finally, in Sect. 6 some examples of application of the 
method are given for cases of practical interest. 
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Fig. 1. Fields involved in the scattering process. 



2 Preliminaries 

From a mathematical viewpoint, the problem consists in solving the two- 
dimensional Helmholtz equation in the open domain constituted by the half- 
space to the left side of the planar interface, outside the cylinders (see Fig. 1). 
Boundary conditions on the various discontinuity surfaces have to be impo- 
sed by taking the electromagnetic properties of the interacting bodies into 
account. As said before, the key for solving this problem is the use of the 
angular spectrum of the cylindrical waves, which has been recently found for 
an arbitrary-order cylindrical wave [37]. Indeed, by using such a tool, the 
reflection of a typical cylindrical wave by a generally reflecting plane surface 
can be evaluated. The price to be paid is the evaluation of some integrals 
of oscillating functions. In most cases, these integrals have to be calculated 
numerically and this represents a computational bottle-neck of the method. 
However, the presented analysis offers a series of advantages, as will be clear 
in the following. 

Before entering the mathematical details of our procedure, we want to 
outline in an intuitive way the fundamental ideas constituting the basis of 
the method. Essentially, the total electromagnetic held resulting from the 
multiple interaction process is considered as the superposition of several con- 
tributions, which are: 

• Vi', held due to the incident plane wave; 

• Vr'. held due to the reflection of the incident plane wave by the planar 
surface; 

• Vd'. held diffracted by the cylinders; 

• Vdr- held due to the reflection of Vd by the planar surface; 
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• Vc'. total field inside the diffracting cylinders. 

Each of these contributions have to be expressed by means of suitable 
modal expansions. Since the diffractive structure consists in one or more 
circular cylinders with parallel axes, each of the above fields will be expanded 
as a superposition of cylindrical waves centered on the cylinder axes. 

3 Scattering from Perfectly Conducting Cylinders 




Fig. 2. Geometry of the scattering problem. 



In Fig. 2 the geometrical layout of a typical scattering problem is shown: JV 
perfectly conducting circular cylinders with radii a* (t = 1, . . . , JV) are placed 
in front of the reflecting surface. The axis of each cylinder is orthogonal to 
the plane and the structure is assumed to be infinite along the y direction, 
so that the problem is reduced to a two-dimensional form. In the following, 
for brevity, dimensionless coordinates ^ = kx, ( = kz are used. V (^, C) 
represents the component of the electromagnetic field parallel to the y axis, 
i.e., V = Ey for TM polarization and V = Hy for TE polarization. In this 
section we suppose that the incident field is a monochromatic plane wave 
whose wavevector, say k*, lies in the plane and {p is the angle between 
the wavevector k* and the ^-axis. In Sect. 5 we will generalize the incident 
field by considering a general, two-dimensional, monochromatic field. The 
presence of the planar reflecting surface is taken into account by means of its 
complex plane-wave reflection coefficient E (u||), where n = k//c, k being the 
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wavevector of a typical plane wave incident on the surface, and fc = 27 t/A is 
its wavenumber. Here and in the following, the symbols _L and || refer to the 
orthogonal and parallel components of a vector with respect to the interface, 
respectively. The time dependence of the field is assumed to be exp(— twt), 
where lo is the angular frequency, which will be omitted throughout the paper. 

Let Pi = (t = 1, • ■ • ,N) be the position vector of the center of 

the tth cylinder in the main reference frame (0,5, C) {MRF from now on), 
and Pi = (5i, Ct) (t = 1, • ■ • , iV) be the position vector in the frame centered 
on that cylinder (RFt). It is convenient to choose the 5 axis of MRF lying 
on the reflecting surface. 

As far as the incident held Vi is concerned, the following expansion, in 
terms of cylindrical waves centered on RFt, holds [38]: 



Vi (5. C) = Vo exp exp + in[|Ct 



+ 00 



Vo exp ^ exp {-irntp) Jm (pt) exp (irri'dt) 



( 1 ) 



where {pt,'dt) ore polar coordinates in RFt, and use has been made of the 
expansion of a plane wave in terms of Bessel functions Jm [!]• Equation (1) 
gives the held associated to the point having coordinates (5, C) MRF as a 
function of the coordinates in RFt- 

A similar expansion can be given for the reflected field Vr, i.e. [38], 

Vr (5, 0 = Vo F exp + zn[|C?) exp + znf|0 



= VoF 



(nfi) exp (-mi 5? + znf|C?) 



(2) 



+ 00 



X E z™ exp {-irrup) Jm {pt) exp {im'dt) 



m— — cx) 



where p = tt — p is the angle of propagation of the reflected plane wave (see 
Fig. 2). 

The diffracted field Vd can be expressed as a sum of cylindrical functions 
with unknown coefficients Csm (s = Ij-- - zn = 0,±1,±2,...) in the 
following way [39]: 



N 



+ 00 



Vd (5, C) = 1^0 E E (-*™V5) Csm CWm (5s, Cs) 

s—1 m— — (X) 

where CWm (5s> Cs) is the cylindrical function 

CW,„ (5s, Cs) = H^'> (ps) exp (zm-ds) , 



(3) 



(4) 
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hIt^ is the outgoing Hankel function [1], and (^s,Cs) are rectangular coordi- 
nates corresponding to {ps,'&s) (see Fig. 2). The last term to be analyzed is 
Vdr- In [39] it was shown that the field due to the reflection of the cylindri- 
cal wave CWm {^s, Cs) by the interface can be written as RW^ (2x« — Cs)? 
RWm (^, Q being the function deflned as 



1 

KW™(C,C)=^y I" (ny) (^,n||) exp (inyC) dny, (5) 

where Xs = denotes the distance between the axis of the s-th cylinder 
and the reflecting surface (see Fig. 2), and Fm (^, ny) is the angular spectrum 
of the cylindrical function CWm, deflned as 



/ + 00 

CW,„ (^, C) exp (-inyC) dC, 

— oo 



whose explicit expression is [39,37] 

2 exp (in±^) 



ny ) — 



n_L 



exp (—7 



im arccos 



ny) . 



( 6 ) 



(7) 



By considering the reflection of each cylindrical wave and summing all 
the contributions, from (3) we obtain 



N +00 



Vdr (C, C) = 1^0 X! X! exp(-zmv3) Cs^RW^(2xs-6,Cs)- (8) 



m— — oo 



Equation (8) shows that the diffracted-reflected held can be thought of 
as the superposition of flelds diffracted by “image” cylinders centered at the 
points (— ^°, Cs)) for s = 1, . . . , A^. These flelds are modulated by the presence 
of the interface through the definition of RWm functions [see (5)]. 

Finally, due to the hypothesis of perfectly conducting cylinders, the held 
must vanish inside them, so that Vc (C>C) = 0- 

In this case, the boundary conditions are expressed by the following equa- 
tions: 



Vi + Vr + Vd + Vdr\p^^kat—^’ (*^) 

for TM polarization, and 

+ dp,Vr + dp^Vd + dp,VdAp^=ka. = 0. (10) 

for TE polarization. After some algebra, the following linear system for the 
unknown coefficients Cge is obtained [38]: 
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N +00 






where 

K, = i^exp{-U^) 



/to = 0, ±1, ±2,.. A 

V J’ 



( 11 ) 



X { [CW^_m (Cstj Cst) (1 ~ '^st) + l|+m] An (fcot) + , 

( 12 ) 



I,%^ = RW,+^ (x,. + Xt,(t-0, 



(13) 



exp TA (kat) 



exp (i exp {—irmp) + F exp (— i exp (—irmp) 



(14) 

where the function (a^), which brings information about the polarization 
state of the incident field, is defined as [39] 



An (a;) 



Jva (a^) 
(a;) 

An (a^) 

. (x) 



for TM polarization, 



for TE polarization. 



(15) 



the prime denoting derivation. The solution of the system (11) leads to the 
evaluation of the unknown coefficients Cs£ and then to the complete determi- 
nation of the total field both in the near and in the far region [38]. 

Some words have now to be spent to illustrate the main computational 
advantages and limitations of the present method. Although no approxima- 
tions have been introduced in the theoretical basis of the method, in order to 
implement it we have unavoidably to truncate the series in (11). Physically, 
this means that the field diffracted by the sth cylinder is expressed by a finite 
sum of CWm functions [see (3)], letting m run from —Mg to Mg, Mg being 
the truncation order for the sth cylinder. As a matter of fact. Mg depends 
on the size of the sth cylinder [38] and therefore the algorithm complexity 
grows with the size and the number of cylinders. 
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An important problem to be considered is the numerical evaluation of the 
matrix elements [see (13)], which actually represents the most limiting 
factor of the method, at least from a computational point of view. Indeed, 
these functions are defined by means of integrals of oscillating functions [see 
(5)], whose amplitude and frequency strongly depend on the order £ -\- m 
and on the mutual distances C? — Cs = 1,2, ... ,N), respectively. As 
can be shown [38] , the maximum value of £ + m is related to the radius of 
the cylinders, while the transverse (i.e., in the C direction) extension of the 
whole diffractive structure fixes the maximum value of C? ~ C«- High values 
oi £ + m or C® — make the evaluation of the involved integrals rather heavy, 
and special integration routines, e.g., based on a combined use of Gaussian 
techniques and extrapolation methods [40], have to be used. We tested the 
algorithm for values of Mg up to some tens, corresponding to cylinders whose 
diameters are of the order of some wavelengths. It should be stressed that in 
such cases neither the Rayleigh nor the Kirchhoff approximation (valid for 
very small or very large objects, respectively) can be used. 



4 Generalization to Dielectric Cylinders 

The theory developed in the previous section deals with the scattering pro- 
blem by a set of perfectly conducting cylinders. However, in many cases such 
a model does not work properly. This happens, for example, in the case of 
TE polarization, where the hypothesis of infinite conductivity is often not 
valid, as well as in dissipative structures. 

Essentially, the generalization to the dielectric (possibly dissipative) case 
consists in considering also the field present inside the tth dielectric cylinder, 
denoted by V*, which has the form [3] 

+ 00 

(C. C) = ^0 X! (~*w) dtm Jm (nlpt) exp(tmdt), (16) 

m——oo 

where dtm (t = 1,... ,N) are further unknown coefficients and n* (t = 
1, . . . , N) are the complex refractive indexes of the cylinders. In such a case, 
boundary conditions are defined in a more complicated way than in the case 
of perfectly conducting cylinders [see (9)], because the continuity of both the 
electric and the magnetic field are to be imposed. This leads to a pair of 
equations for each polarization state, i.e., 

( + Vr + Vd+ Vdrlp^^kat = '^c\pt=kat > 

{ (17) 

[ + dptVr + dp^Vd + “ 'I* , 

where qt = 1 for TM polarization and qt = (l/n*)^ for TE polarization 
(t = 1, . . . , N). By proceeding in the same way as for the perfectly conducting 
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case, the following coupled linear systems are derived [41]: 



N +00 



/ -^rni ^tra — -Om 5 



£— — 00 



E \^ T _ 0^(2) 

/ -^rni ^tm — -Om 5 

S —1 i— — 00 

where superscripts (1) and (2) refer to the boundary conditions on the field 
and its normal derivative, respectively. In particular 



Hm (kat) 



with pt defined as 



{ n* for TM polarization, 

1/n* for TE polarization. 

Furthermore, the coefficients of the linear systems (18) are 
= i^expi-Up) 

X I [CW^_m (^st, <Cst) (1 — 6st) + ^ (kat) + SstSim'^ 



i!l^ = RW,+r„ {x., + xtXt-Cs), 



= -i""exp (*n||C°) (kat) 

X exp exp {—imp) + F exp exp {—imp) , 

(24) 

and the functions Tin ' {x) are defined as 

„( 1 ) X X _ Jm {x) 

B«(x)’ 



(25) 
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rr(2) / 'I ^ (^) 



(a;) 



(26) 



A way to solve the systems (18) is to eliminate the coefficients dtm, thus 
obtaining the following linear system for the sole Csi coefficients 



N +00 
^ ^ D^^,Csi = L 

s=l £— — <yD 



t 

m 5 



(27) 



where 



]-)St /^i(2) 

^m£ -^ral ’ 



(28) 



LL = 



m m 



(29) 



Finally, once the Cqe coefficients are known, it is possible to evaluate the 
internal field by means of the dtm coefficients in the following way: 



df/m — 



1 






N +00 

Bff'-Br-E E [ 

5 = 1 £— — oq 



.si(l) j,st(2) 



^si 



(30) 



5 General Incident Fields 

Once the plane-wave scattering by a set of cylinders has been well establis- 
hed, an extension to treat the case of a generic two-dimensional impinging 
field can be performed. This case is particularly useful when the plane-wave 
approximation is not valid, for example when the field has been formed by an 
antenna or an optical system and the scattering elements are in its near-field 
or intermediate field region. 

The general lines closely follow the plane-wave case. Assuming the CWm 
functions as the basis for the solution of the Helmholtz equation with Som- 
merfeld radiation condition, use of (3), (8), and (16) can be made also in this 
case: therefore, it is sufficient to give suitable expressions for the fields Vi and 

V. 

Now, we assume that the y-component of the incident field is represented 
by means of the Fourier integral 



C(o,o 



27T 



r+oo 



Uo{p) exp{iC'p)dp. 



(31) 
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Fig. 3. Plane-wave decomposition of a general incident field. 



In order to easily handle different incident fields, we introduce a further 
reference frame, Incident Field Reference Frame (IFRF, for short). In Fig. 3 a 
Gaussian profile is sketched as an example. In (31), p is the conjugate variable 
of and Uo{p) is the angular spectrum of the incident field. A change of 
reference frame from IFRF to MRF allows us to obtain the expression of V) 
as [18] 



V ^{£„0 



2tt 



r~\-oo 



7T / Go(p)exp{i[(C-Co)cos'!/'p+(C-Co)sinV'p]}dp, 



(32) 



where (CojCo) are the coordinates of the IFRF origin, use has been made of 
the angle (p formed by the and ^ axes, and of the angle 'tpp = arcsinp -|- p 
between the propagation direction of the generic plane wave of the spectrum 
(32) and the ^ axis. Across the plane ^ = 0 we have 



+ 00 



v;(o,c) 



Vo 

27T 



— oo 



Uo{p)r{sm tpp) exp{i[-^o cos ipp + {C - Co) sin ipp]}dp. 

( 33 ) 
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Letting the plane- wave components of 14-(0, C) propagate in the half-plane 
^ < 0, we obtain the field C) as follows: 

TA /*+00 

i4.(c,c) = ^y Uoipmsin^pp) 

X exp{i[-(^ -h ^o) cos V’p + (C - Co) sin -)pp]}dp. (34) 

By performing the same procedures as in the plane-wave case, we obtain a 
linear system of the same form as in (11) (or (18)), with the same coefficient 
matrix, but whose matrix B is modified as follows: 



BL = -V 



2tt 



+00 



VoAp) exp[i(C° - Co) sinV'p] 



X {exp[i(^° - ^o) cosf/'p] exp(-imV’p) 



(35) 



+r{sintpp) exp[-i(^° -h Co) cos V'p] exp(-imV’p)dp} , 
with Tpp = TT — Tpp. 



6 Applications 

Several practical problems can be studied through the above formalism, and 
some of them have already been considered. The interaction between two 
perfectly conducting cylinders placed on a perfect mirror was analyzed in 
[38], where the effect of the polarization of the incident field on the far-field 
features was highlighted, together with the influence of the cylinder-cylinder 
distance. Furthermore, the case of a finite diffraction grating placed near a 
vacuum-dielectric interface was also treated. The grating consists of N equi- 
distant identical cylinders, whose axes lie on a plane, parallel to the reflecting 
surface. This structure is of considerable importance in optics and microwa- 
ves, as well as in the study of some aspects of the light-matter interaction like, 
for instance, enhanced backscattering [42] and surface-polaritons interaction 
[43]. Another important application of the present formalism is relevant to 
the additional microwave heating of toroidal plasmas [44]. In particular, the 
study of a finite diffraction grating turns out to be useful in the analysis 
of the so-called quasi-optical grills. In this case, it is of great importance 
to launch a quite narrow spectrum of evanescent waves inside the magneti- 
zed plasma column. Only these waves, indeed, propagate inside the plasma 
in the slow-wave lower-hybrid regime and can contribute to the heating of 
the internal plasma regions. In [15,18] several configurations of quasi-optical 
launchers have been analyzed, and the corresponding performances in terms 
of coupling parameters were given, together with the electric field map in- 
side the structure. It is worth stressing that this requires a complete and 
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precise knowledge of the scattered field in the near-zone. As far as the case 
of dielectric cylinders is concerned, the plane-wave scattering from a single 
Si02 cylinder placed on a fiat silicon substrate was reported in [41], where 
the intensity inside the cylinder was also presented. Researches are currently 
in progress on the study of the interaction of surface waves with cylindrical 
objects. 
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Abstract. The problem of light scattering by a particle on or near a surface is 
treated using the field decomposition method and the free-held T-matrix method. 
The model takes into account that the incident held strikes the particle either 
directly or after interacting with the surface, while the helds emanating from the 
particle may also rehect off the surface and interact with the particle again. The 
rehection of the scattered and incident helds are determined in the context of a 
T-operator formalism which enables us to compute scattering by inhnite surfaces. 
The solution for a plane interface is obtained as a special case of this general 
formalism. An approximate solution is also given by assuming that the scattered 
held, rehecting oh the surface and interacting with the particle, is incident upon 
the surface at near-normal incidence. The range of validity of the approximate and 
rigorous method is checked from a numerical point of view. 



1 Introduction 

Particle contamination characterization of a silicon wafer surface is of great 
importance in semiconductor manufacturing. As semiconductor device di- 
mensions become smaller, there is a need for wafer surface inspection systems 
to detect the size of microcontaminations to as low as 0.1 /rm or even smaller. 
To expand the current detection ability an efficient mathematical model and 
computer simulation technique is needed. 

Calculation of light scattering from particles deposited upon a surface is 
of great interest in the simulation, development and calibration of a surface 
scanner. Several studies have addressed this problem with the use of widely 
differing methods. 

Some simplified theoretical models have been developed on the basis of 
Mie light-scattering theory and Fresnel surface reflection [1-4]. These appro- 
aches represent an extension of Mie theory and are focussed on the light 
scattering problem of a sphere on or near a plane surface. A coupled-dipole 
algorithm has also been employed for the scattering problem [5,6]. In this 
case a three dimensional array of dipoles is used to model a feature shape 
and its composition. In order to take the presence of the unbounded space 
into account the Sommerfeld integrals for interaction between a dipole and a 
surface are introduced. 

A model based on the discrete source method was given by Eremin and 
Orlov [7,8]. In this approach the transmission conditions at the half-space 
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interface are taken into account in an analytical manner so that the algorithm 
does not need to satisfy them numerically. This is done by using the Green 
tensor components for the interface to construct the fields of discrete sources. 

Similar scattering problems were considered by Kristensson and Strom 
[9], and Hackmann and Sammelmann [10] in the context of the extended bo- 
undary condition method. Acoustic scattering from a buried inhomogeneity is 
discussed by Kristensson and Strom by assuming that the free-held T-matrix 
of the particle modihes the free-held T-operator of the arbitrary surface. By 
projecting the free-held T-operator of the surface onto a spherical basis the 
authors derive an inhnite system of linear equations to determine the coupling 
of the free-held T-matrix of the particle at each order of perturbation theory. 
In contrast, for the acoustic scattering from an elastic shell in a waveguide 
Hackmann and Sammelmann consider that the free-held T-operator of the 
surface modihes the free-held T-matrix of the particle. In this context, the 
free-held T-matrix of the particle is projected onto a rectangular basis and 
an integral equation for the spectral amplitudes of the helds even in zeroth 
order perturbation theory is derived. 

In the present contribution we use the held decomposition method and 
the free-held T-matrix approach [11] to analyze scattering from a particle 
situated on or near an inhnite surface. The method takes into account that 
the incident held strikes the particle either directly or after interacting with 
the surface, while the helds emanating from the particle may also rehect oh 
the surface and interact with the particle again. The rehected scattered held 
or the interacting held is computed in the context of a T-operator formalism 
using the integral representation of spherical vector wave functions over plane 
waves. For these helds we assume expansions in terms of regular or radiating 
spherical vector wave functions. The transition matrix, or the free-held T- 
matrix relates the expansion coefhcients of the incident helds to the scattered 
held. However, for some special scattering conhgurations such an approach 
is expected to fail as a consequence of the geometrical constraints of the T- 
matrix method. The expansion of the scattered held in terms of radiating 
multipoles is valid outside the circumscribed sphere of the scatterer, and in 
general this fact generates restrictions in applying the T-matrix method for 
analyzing multiple scattering problem. For example, in the case of an oblate 
particle situated upon a wafer surface the circumscribed sphere intersects the 
plane interface, and consequently the geometrical constraints of the T-matrix 
method are violated. 

2 Geometry of the Scattering Problem 

The geometry of the scattering system is shown in Fig. 1. An axisymme- 
tric particle is situated above an inhnite surface S. The surface S separates 
the two half-spaces Dq and Z?i, and is assumed to be sufhciently regular for 
application of Green’s theorem. We choose the origin of a rectangular coor- 




7 T-Matrix Method for Light Scattering from a Particle on Surfaces 



115 



dinate system Oxyz at a distance zq above the mean plane E of the surface 
S, and assume that the surface is bounded by two parallel planes z = z> and 
z = z^. The normal to these planes defines the z axis. We assume that the z 
axis is directed into Dq and coincides with the particle symmetry axis. The 
half-spaces Dq and D\ are considered to be isotropic, homogeneous, and non- 
magnetic, i.e. yo = = y- The incident radiation is a plane wave traveling 

in the xz plane, oriented at angle Pi„c with respect to the z axis. The wave- 
numbers in each region are denoted by kg = uj{eQn)^^‘^ and fci = 
respectively. 




3 Scattering by Infinite Surfaces 

Before analyzing the multiple scattering problem we digress and present in 
this section the fundaments of the T-operator formalism for solving the scat- 
tering problem by infinite surfaces. This approach will be used latter for 
computing the scattering characteristics of an axisymmetric particle in the 
presence of an infinite surface. Our presentation closely follows the guidelines 
given by Kristensson [12]. 
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3.1 Notations and Mathematical Formulation of the Scattering 
Problem 

Let us consider the scattering problem of an infinite surface S illuminated by 
the incident wave field (Ei„c(r), H^^^(r)). The mathematical formulation of 
the boundary value problem consists in Maxwell’s equations 

V X Ep(r) = jw/iHp(r) , , 

V X Hp(r) = -ja;epEj,(r), p = 0, 1 ^ ’ 

the boundary condition at the surface S 

n X [Eo(r) + E„^(r) - E,(r)] = 0 , . 

nx[Ho(r) + H_(r)-H,(r)]=0 

and the radiation/attenuation conditions for the fields in the half-spaces at 
infinity. Here, n is the unit normal vector to the interface pointing into region 
Dq. Assuming Jm(fco i) > 0 then, there is a unique solution of the boundary 
value problem (l)-(2). 

In the following we will extensively use representations of electromagnetic 
fields in terms of vector plane waves. We briefly recall the definitions and the 
basic properties of scalar and vector plane waves [13]. The scalar plane wave 
is defined by 

x(r,K±) =exp(jK± • r) =eyi]y[j{K^x + Kyy ± K^z)] (3) 

where K± = + KyQy ± K^e^- Using the notation A = + K^, 

can be expressed as = \/fc^ — A^, where the square root is always chosen 
to have a positive imaginary part. For real k , Kz is given by Kz = Vk^ — A^, 
if X < k, and by Kz = — k'^ if X > k. The case X < k corresponds to 

harmonic propagating waves, while the case X > k corresponds to evanescent 
waves. 

The vector waves M(r, K±) and N(r,K±) are defined in terms of the 
scalar plane waves as 

M(r, K±) = V X [e,x(r, K±)j /A = j(KT/|KT|)x(r, K±) 

N(r,K±) = V X M{r,K±)/k = - [(K±/|K±|) x (Kt/|Kt|)] x(r, K±) 

( 4 ) 
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hold on a plane S, z = const, where |K±p = |Kj_p = M(— r, Kj_) = V x 

I^±)] N(— r, K±) = Vx M(— r, K±)/k. We note here the al- 

ternative expressions for the vector plane waves: M(r, K±) = — jecx(f;K±) 
and N(r,K±) = — e/ 3 x(fjK±), where (efc,e^,ea) are the spherical unit vec- 
tors in the K-space. 

Now, let us apply the Huygens principle or the extinction theorem to 
a surface consisting of a finite part of S and a lower half sphere. Let the 
radius of the sphere go to infinity and assume the integrals over S exist, and 
furthermore that the integrals over the lower half-space vanish (radiation 
conditions). We get 



E(r) 

0 



Ei„c(r)+ / nxEi(r) -V xGgo(r,r) 



-h 



nxVxEi(r') •Ggp(r',r) dS , 



r gL>i 



( 6 ) 



u , 

Here, E(r) = E-^^(r) -|- Eg(r) is the total field in region Dq and Ggg(r ,r) 
is the free-space dyadic Green function of wavenumber kg. The superscript 
(0) indicates that the free-space dyadic corresponds to region Dq- Since the 
T-matrix formalism is based on suitable expansions of the Green’s dyadic we 
use the plane wave expansion. This type of expansion has been extensively 
analyzed in the literature [12,13] and is of the form 



=0 , 
Geo(i' t) 




M(-r' , K° ) M(r, K° ) -H N(-r' , K°_) N(r, K° ) 



KO 

( 7 ) 



when 2 > 2 : , and 



=0 , 

Geo(l' T) 




M(-r', K° ) M(r, K° ) -h N(-r', K° ) N(r, K'].) 



dK^.dK^y 

( 8 ) 



when z < z . 



3.2 T-Operator Formalism 

We now turn to the derivation of a set of basic equations which will be used to 
compute the scattered field. The entire analysis can conveniently be broken 
down into the following steps: 

(I) The external problem consisting in the null-field condition (6), for 
r GDi, is analyzed. This problem is called the external problem, because the 
fields external to S are the same as in the original problem. The null-field con- 
dition asserts that for all points below the surface, the field due to the sources 
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distributed over the surface must exactly extinguish the incident wave. Ma- 
thematically, the null-field condition gives an expression for the unknown 
surface current densities in terms of the incident wave. These equations are 
put in a form suitable for numerical solution by expansion of the various field 
quantities into plane waves. 

(II) Once the surface current densities are determined the scattered field 
in the region Dq is obtained by using the field representation (6), for r G Oq. 
We note that (6) simply asserts that, above the surface, the total field is the 
sum of the incident field and the radiation due to the sources distributed over 
the surface. 

We begin by considering the null-field condition in the region D\ and 
represent the incoming field as a superposition of plane waves, i.e. 

/■ r n n n n n dK^dK° 

E„e(r) = J [A(K° ) M(r, K° ) + B(K° ) N(r, K°_ )] (9) 

Assume that the integral representation is valid at least for z < z<. The 
incoming field is as usual a prescribed field, whose sources are assumed to 
be situated in Dq. This means that in any case the sources are above the 
fictitious plane z = z<: and an expansion of the incident field as in (9) can be 
found for a wide class of sources. 

For an r GDi satisfying z < z<, we insert the plane wave expansion of the 
Green’s dyadic in (6) and due to the orthogonality properties of the plane 
vector waves (5) we get 



A(K° ) 



B(K° ) 






n X Ei(r ) • N(-r ,K‘(.) 



a 

+{l/ko) nxVxEi(r') • M(-r', K°l)} 
[n X Ei(r')j • M(-r',K° ) 



dS 






a 

-k(l/fco) nxVxEi(r') • N(-r', K0_)} dS'. 



(10) 



The next step of the T-matrix formalism is the expansion of the surface 
current density n x Ei(r ) into a suitable complete sets of functions. By 
means of these expansion, an algorithm for eliminating the surface fields can 
be constructed. In this context we use the well known result which states 
that the tangential plane vector waves {n x M(r,K?_),n x N(r, K^)}, with 
KxiKy G are complete on L'^{S) [9,12]. Hence, we represent the surface 
current density as a superposition of down-going plane waves, i.e. 



nxEi(r') = J {C(K]_)c(K]_) [nxM(r,KL)] 

+ D{Kl)d{Kl) [nxN{r,Kl)]}dKldKl. 



( 11 ) 
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We explicitly write the plane wave transmission coefficients in (11), 

c = 2X°/(X0 + Kl) exp[-j(ifO - Kl)zo] .. 

d = + Kl) exp[-j(i^° - Kl)z^] 

where m denotes the relative index of refraction of the half-space D\, and 
the wave vectors K° and K)_are related to each other by the Snell’s law: 

A = Ao = Ai = + (i^0)2 = ^(^1)2 + (^1)2, (13) 

This expansion can now be inserted into Eq.(lO) and we get 

= j [0(K°,KD] dKldKl (14) 

where 



[Q(K°,Ki)] 



■qii(K°,K^) Q12(ko,k^)- 
_Q21(ko,K^) Q22(ko^K)_) 



(15) 



and 



gii(K').,K)_) = -£|c(K):) J I nxM(r',Ki) -N(-r',K°) 

s 

+m^ nxN(r',KL) • M(-r', K0.)| dS” 

gi2(K0^,K)_) = -£|d(KL) J I nxN(r',K)_) •N(-r',K°) 

s 

+m^ [n X M(r',K)_)j ■ M(-r' ,K°)\ dS 

2 /• , p ^ , (16) 

g2i(K').,K)_) = -|^c(KL) / I nxM(r,K)_) •M(-r,K°) 

s 

+m^ nxN(r',K)_) • N(-r', K°_)} dS” 

g22(K<).,K)_) = -|§d(KL) J { nxN(r',Ki) •M(-r',K°) 

s 

+m^ n X M(r',K)_) • N(-r', K')_)| dS” . 

An inspection of (16) reveals that g“^(K° ,K)_), a, P = 1,2, are in ge- 
neral not functions in the usual sense and relations involving g“^(K')_, K)_) 
should be understood in a distributional sense. For a plane surface we have 

gii(P'“”")(K° ,Ki) = g22(p'“"")(K° ,Ki) = {koKl)5{Kl - K°,kI~K°) 

(17) 




120 



T. Wriedt and A. Doicu 



and 



Ql2(P'a»e)(K° ,Ki) = Q21(piane)(j^0 ^ ^ 

In this case the system of integral equations (14) reduces to a system of 
algebraic equations 

A{K°_) = {koK°)C{Kl) 

B{K°_) = {koK°)D(Kl) ^ ^ 



where the wave vectors K'). and K)_are related to each other by the Snell’s 
law. Obviously for solving the general case of an arbitrary surface, one has 
to invert a system of two-dimensional integral transforms and this usually 
constitutes a formidable analytic and numerical problem. We can formally 
assume that the inverse integral transform exists and consequently the solu- 
tion to (14) may be written as 



C(K^) 

D{Kl) 



= I [g(K0_,Ki)] ' 



A(K° ) 



dKldK^. 



(20) 



We consider now the expression of the scattered field in the region Dq 
given by (6). Expanding the dyadic Green function in terms of plane waves 
and inserting this expansion in Eq. (6) we get, for z > z> , 

r dK^dK^ 

Eo(r)= j [F(K°)M(r,K^) + G(KO)N(r,K°)] . (21) 

The vector amplitudes E(K(|.) and G(K(|.) are given by 



E(K°) 



G(K°) 




+(l/fco) 



n X Ei(r')j 

nxVxEi(r 

n X Ei(r') 

|"nxVxEi(r 



•N(-r ,K°) 

')] .M(-r ,K°)} 
•M(-r ,K^) 

')] .N(-r ,K°)} 



dS 



dS. 



(22) 



Using the representation of the surface current densities n x Ei(r ) in 
terms of down-going plane waves (11), we obtain a relation between the am- 
plitudes of the field in the region Dq, E(K([) and G(K‘|_), and the amplitudes 
of the field in the region Di, G(K)_) and Z?(K)_), i.e. 



E(K°) 

G(K°) 



= I [P(K°,Ki)] 



G(KL) 

D{Kl) 



dKldKl 



(23) 
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P“(K^,KL) = glc(KL) J {[nx M(r ,Ki)] .N(-r ,K°) 

s 

+m^ nxN(r',K:L) • M(-r', K0_)| dS' 

pi2(K0,KL) = gd(K^) y'{[nxN(r',KL)] •N(-r ,K°) 

s 

W nxM(r',K^) • M(-r', K° ) J dS' 

p2i(K°,KL) = gc(KL) y'{[nxM(r ,Ki)] .M(-r',K0) 

s 

+m2 nxN(r',K:L) • N(-r',K0,)| dS' 

P22(K0,KL) = gd(K^) y'{[nxN(r',KL)] •M(-r ,K°) 

s 

+m2 nxM(r',K:L) • N(-r', K0_)| dS'. 

Combining (20) and (23) we are led to 

>(K°)' 

_G(K°)J 

= J |[P(K^,Ki)] [g(K'_°,Ki)]-' dRyxyKldKl 

= J [T(K°,K'°)] dK'yx'^^ 

where 

[T(K°,K'°)] = J [P(K^,Ki)] [Q{K'^,Kl)]-yKldKl (27) 

is the T— operator of the infinite surface. For a plane surface we have 
pii(piane)(KO ,K^) = (fcoi^°)a(K° )<5(iFi - KlKl - K^) 

and 

P12(P'“"^)(K!|., Ki) = p21(piane)(j^0^ ^ (29) 
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Here, 



a(K° ) = (ifO - K\)l{Kl + K\) exp(-2jXOzo) 

&(K° ) = (rn^Kl - K\)l(m^Kl + K\) exp(-2jX^o) 

are the reflection coefficients for a plane interface. In this case we get 

F(K° ) = (fcoif°)o(K° )C(KL) = o(K° )A(K° ) 

G(K° ) = (fcoif°)6(K° )I?(KL) = b{Kl)B{K°_) 



and clearly, 

rii(P'“”")(K° ,K'0) = a(K° )<I(A:0 - K'°,K° - K'°) 
r22(p'“"«)(K° , K'O) = b{Kl)6{K° - K'°, K° - K'°). 



In (31) the wave vectors K((, K)|_ and K)_ are related to each other by 
Snell’s law. Actually, if K*). is an arbitrary incident wave vector, then K(|_ and 
K)_ represents the reflected and the transmitted wave vectors, respectively. 

We conclude this section by noting the expansion of an electromagnetic 
held expressed as a superposition of plane waves, i.e. 



Eo(r) 



= I [F(K^)M(r,K°) + G(K°)N(r,K^)] 
w- 



dK°dK° 

koK° 



(33) 



in terms of regular spherical vector wave functions. We assume that the in- 
tegral converges for z > Zmin, where Zmin < 0, and transform the integration 
over the rectangular components of the wave vector into an integration over 
polar angles, i.e. 



2-n- 

Eo(r) = — y J [jE(a, /3)ec -I- G(a, /3)e^] exp (K° • r) sin /3d/3da. (34) 

0 c+ 

Here K(|_ = (ko sin /? cos a, ko sin [3 sin a, ko cos /3) , and the complex angle 
[3 belongs to the curve G+ . The contour G+ corresponds to an integration 
path in the complex plane from 0 to 7t/2 — joo. Each plane wave in the 
integrand (34) can be expanded in terms of spherical vector wave functions. 



exp (K° • r) = — 






oo n 

E E 

n=l m——n 

( Jr^l(/3) ' 
ym7r|r'(/3)_ 






, rk '(/?) / 






(35) 



where ttIT’' ( 0) = pJi™' (cos d)/ sin 0, (0) = dP^^cos 9) / d9 , and Pi™' (cos 0) 

are the associated Legendre polynomials. This expansion is valid not only for 
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propagating waves, but also for evanescent waves when the analytic conti- 
nuation of the Legendre functions for complex arguments is evaluated on the 
physical Riemann sheet. Consequently, the desired spherical waves expansion 
is 



n—1 m——n 



where 



^mn 



b 



mn 




F(a,/3)TH(/3) + G(a,/3)m7rjrl 

F(a,/3)m7rlrl(/3) + G(a,/3)rH 



iP) 

iP) 









sin (3d(3da 



sin (ddfdda. 



(37) 

Note, that if the radius of convergence for the series (36) is i?max, then (36) 
is a valid representation for the electromagnetic field (33) if i?max < |-Zmin|- 



4 T-Matrix Method 

In this section we consider the multiple scattering problem of a particle in 
the presence of an infinite surface. Using the field decomposition method we 
simplify the boundary value problem to single scattering problems. 



4.1 Formal Solution for an Arbitrary Interface 

Let us return to our original scattering problem depicted in Fig. 1. The 
incident plane wave strikes the particle either directly or after interacting 
with the surface. We assume that the direct field and the reflected incident 
field can be expressed as a series of regular spherical vector wave functions 
(SVWF) (/cor) and N)„„^(fcor), i.e. 

oo ni 

E E ^mni [^mni Mi,„^(fcor)-k6,„„iN)„„^(/cor)] (38) 

m—1 m——rii 

and 

OO ni 

EL(r)=E E ^™"i[«m«iM)„ni(fcor) -f 6^„^Ni^„^(/cor)]. (39) 

m — 1 m——rii 
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Here, Dmm is a normalization constant and is given by 



D 



mni 



2m + 1 (ni — |m|)! 
4m(ni + 1) (ni + |m|)! ’ 



( 40 ) 



Explicit expressions for the expansion coefficients Omni and bmm corre- 
sponding to a plane wave incidence in the (0, Pine) direction can be found in 

[14]. 

The scattered field is also expressed in terms of radiating SVWF : 



Esca(r) — ^ ^ ] T^rrm[emnM^„(fcr) -|- f (41) 

n—1 m——n 



and this expansion is valid outside a sphere enclosing the scatterer. 

In addition to the three fields described by (38), (39) and (41), a fourth 
field exists in the ambient medium. This field, which for convenience is called 
the interacting field, is a result of the scattered field reflecting off the surface 
and striking the particle again. The interacting field can be expressed in the 
following form 

oo n 

EL(r) = E E 4?r„nKnM^«(fcr) + /„„N^«(fcr)] (42) 

n—1 m——n 



where M^(((fcr) and N^^(fcr) denote the radiating SVWF reflected by the 
surface. For r inside a sphere enclosed in the particle and a given azimuthal 
mode m the reflected SVWF are given by: 



/M^«(A:r)\ 

V N^n(^r) ) 



OO 

n\—l 



( ^mnni 
T^rmni 






Pmnni 

bm.nn-i 






(43) 



Substituting (43) into (42) we get a representation of the interacting field 
in terms of regular SVWF, i.e. 



C30 ni 

EL(r)=E E Dn,nAelnMnn,{kv) + f^^Nl^^{kv)] 

m = l m——n\ 



where 



fR 
J mni 



oo 



ni—1 



D„ 



( ^mnni 
T^rmni 



^mn 



Pn 



(44) 



(45) 



In the T-matrix method the scattered field coefficients are related to the 
expansion coefficients of the fields striking the particle by the transition ma- 
trix. For an axisymmetric particle the equations become decoupled, permit- 
ting a separate solution for each azimuthal mode. For a fixed azimuthal mode 
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m we truncate the expansions given in (38), (39), (41) and (44) and get the 
following matrix equation for the scattering problem: 



Omn 

fmn 



[Trr 



( ^mni 
^mni 



^mni 

bR 

_ mni 



^mni 

fR 

L mni 



(46) 



Here, m = —M, M and n, ni = 1, iV , where M is the number of azimut- 
hal modes and N is the truncation index. The explicit form of the transition 
matrix [Tmnni] can be found in [12]. The expansion coefficients of the interac- 
ting field are related to the scattered field coefficients by a so called reflection 
matrix 



rrmi 

R 



/, 



where, accordingly to (45) 

[^mnin] — 



— [A„ 



^mn 

fmn 



mn^mnni ^ mn'^mnn\ 
mn/^mnni ^mn^mnni 



(47) 



(48) 



The scattered field coefficients Cmn and fmn are obtained by combining ma- 
trix equations (46) and (47). 

The main problem which must be solved is the calculation of the reflec- 
tion matrix. In order to master this problem it is necessary to compute the 
reflection of a plane wave and a radiating spherical vector wave function by 
the infinite surface. Thus, we have to solve the single scattering problem of 
an infinite surface. Actually, we need not only to compute the reflected fields 
and and but also to expand these fields in terms of regu- 

lar spherical vector wave functions. Essentially, this can be done by using 
the T-operator formalism discussed in Sect. 2. It is noted that application 
of the T-operator formalism requires the representation of incident fields as 
integrals over plane waves. 

In this context, let us write the incident plane wave as 



Einc(r) — -I- Ea 



J exp • r) 



= j [A,„,(K° ) M(r, K° ) + H,„e(K° ) N(r, K°_)] ^ (49) 






where 



A„c(K° ) = (A:oX°) 5(X° - - Ar°“^) 

S„e(K0 ) = (koK^,) 5{Kl - - AfO™-) 

Then, the corresponding reflected field takes on the form 

r dK^dK^ 

EL(r)= / [F„e(K°)M(r,K°) + Gi„e(K^)N(r,KO)] 



(50) 



(51) 
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In (49) and Eo^^^ are the parallel and perpendicular component 

of the electric field, respectively. The spectral amplitudes and 

Gi„c(K° ) are computed according to (26). 

Now, we write the radiating SVWF as integrals over plane waves traveling 
into various directions [12,15] 



/M^Jfcr)\ 

V ^Inikv) ) 

^ /■r/A,(K°) 



M(r,K° ) 






N(r,K°) 



d^dK^y 

koK2 



(52) 



where 



A,(K0) = 2^r^l(/3)e^■™^ 

Bs{K°_) = 2^^iqrTm7rL’”'(/3)e-^' 



(53) 



and K° = (/cq sin/3cos a, /cq sin/3sin a, — /cq cos /3 ) . Representations (52) are 
valid for z < 0 since only then the integrals converge. Hence, application of 
T— operator formalism leads to 



M^^(fcr) 

^^±{kr) 






Gs{Kl) 



M(r,K°) 



G.(K°)' 

F,(K°) 



N(r,K^) 



dK^dK^y 

koK° 



(54) 



where the spectral amplitudes J^s(K° ) and Gs(K'|_) are computed according 
to (26). Analog to the previous case the expansion coefficients in (43) are 
given by (37) and therefore the reflection matrix can be computed according 
to (48) . The solution given above is formal and in practice of limited use since 
we assumed the invertibility of the integral equation (14). As stated before 
equations with a continuous variable often lead to inversion of integral equa- 
tions which is a delicate problem for numerical calculations. The usefulness 
of the formalism in numerical applications will to a large degree depend upon 
the possibility of discretizing (14) in a suitable way. This can be done for a 
class of simple infinite surfaces as for example the plane surface or surfaces 
with periodic roughness and small amplitude roughness. In the following we 
specialize the above formalism to scattering from a plane surface for which 
an exact solution is obtained. 



4.2 Exact Solution for a Plane Interface 

For a plane surface, (14) and (23) will degenerate to just simple algebraic 
expressions and the inversion is trivial. Accounting of (19) and (31) we get 
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the following expression for the reflected incident field: 

Ef„.(r) 

r dK°dK° 

= J [^,„,(K0_)a(K°)M(r,K°) + S,„,(K0_)6(K°)N(r,K°)] 

(55) 



Here K° and K*). are related to each other by Snell’s law, that means 
Ky,±K^) . As expected, denoting by the reflected wave 

vector corresponding to the incident wave vector and by ainc,r and 

Pinc,r the angular spherical coordinates of where ainc,r = oimc and 

/3mc,r = 7t - |3^nc, we And 



Ef„,(r) = 






^ai„c^i(^inc,r)^ain,c,r ) exp (jK' 



Oinc 

+ 



0- 

(56) 



In this context, the coefficients and appearing in (39) are the 
expansion coefficients of a plane wave traveling in the (0, (3inc,r) direction. 

We turn now our attention to the reflection of a spherical multipole in 
the case of a plane interface. The simplified expressions for M^(fcr) and 
N^(fcr) take on the forms 



/^M^(A:r)\ 

V (A:r) ) 

_ /■r/A,(K0_) 



a(K°)M(r,K^) 



U«(K°-) 



6(K°)N(r, 






d^dK^y 
koRO ■ 



(57) 



Denoting the angular coordinates of K'}^ and K'). by (ar,Pr) and {a, (3), 
respectively, where a = ar and [3 = tt — f3r, and transforming the integration 
over the rectangular components of the wave vector into an integration over 
polar angles, we get 



{M^±{kr)\ 

V J 



27T 



0 c_ 









'W 



X exp{jma) exp(jK(j.-r) sin j3d(3da 



r^'(/3) 

{(3) 






(58) 



It is noted that the contour C- corresponds to an integration path in the 
complex plane from 7 t/2 + joo to tt. Clearly, each elementary plane wave in 
the integrand of (58) represents the reflection of the corresponding elemen- 
tary incident wave in the integrand of (52), taking into account the Fresnel 
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reflection coefficients at the interface. Making use of the spherical waves ex- 
pansion (43), we And that the elements of the reflection matrix are given 
by 



— 4j'ni-n 



mVjr'(/3)7r|rb(7r-/3)&(^-/3) 



c_ 



- P)a{TT - j3) 



sin f3dl3 



Pmnni — 






7r|T'(/3)rJr^'(^-/3)&(^-/3) 



C- 



+rJr''(/3)7rj(7^'(7r - /3)(7t - /3) sin pd(3 



‘^mnni — 4j 






T^'(/3)7^|,7l(7^-/3)&(^-/3) 



c_ 



X — Ajni-n 



-|-7r|r'(/3)Ti7^'(^ - /3)a(7T - /3) sin (3df3 

rtKf3)rlTK^ - - P) 



(59) 



-|-TO^7rj(”' (/3)7 t|i™^ ' (tt — f3)a{TT — (3) sin j3dj3. 



4.3 Approximate T-Matrix Method 



An approximate expression for the reflected spherical waves can be found 
if one assumes that the interacting radiation strikes the surface at normal 
incidence. A Fresnel reflection at normal incidence is used to account for the 
reflection loss at the interface. Writing 

a{(3r) = d{f3r) exp(— 2jfcoZo cos j3r) 
b{Pr) = b{Pr) exp{-2jkoZocos (3r) 

assuming 

r(0) = d{Pr) = -b{Pr), 

changing the integration variable from f3 to (3r = tt — (3, and using 



(60) 



(61) 
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we obtain the following simplified integral representations for the reflected 
SVWF: 



V N^^(fcr) ) 




^0R + j 



0 C- 



y mTrjr' {Pr) ^ 



X exp(jma_R) exp(-2jfczo cos Pr) exp(jK‘]_-r) sin P^dPRciaR. 

(63) 



Let us introduce the image coordinate system O x y z by shifting the 
original coordinate system a distance 2zq along the negative z axis. The 
point O is the image of the point O with respect to the surface S. With the 
notation K° • r = K*). • r—2kzo cos Pr, where r = {x ,y , z ),we identify in 
(63) the integral representation over the plane waves of the radiating SVWF 
in the half-space z' > O.Thus, 



/M^«(fcr)\ 
N^«(fcr) ) 






V N^jfcr');- 



(64) 



In this case the interaction held is the image of the scattered held. The 
expansion (43) can be obtained by using the addition theorem for spherical 
vector wave functions. The elements of the reflection matrix, which essenti- 
ally represents the translation coefficients can be computed recursively. Re- 
currence relations for the translation coefficients of vector multipoles field are 
available in the literature [2-4,16]. As a result, the amount of computer time 
to solve the electromagnetic scattering problem is greatly reduced. 

The approximate scattering model consisting in computation of reflected 
spherical vector waves according to (64) is based on the assumption that the 
interacting field is incident on the surface at near-normal incidence. In the 
context of a ray tracing model this assumption is justified by the fact that 
the maximum angle at which a ray emanating from the center of the image 
coordinate system strikes the surface is small (30° for a sphere), and the 
Fresnel coefficients are fairly constant from normal incidence up to this angle 
[2-4]. In view of the above justification we note that the maximum angle 
is large for particles with strongly deformed surfaces, such as oblates, and 
decreases with increasing the distance between the particle and the surface. 
Furthermore, an inspection of (59) reveals that for particles with high size 
parameters the matrix equation includes Legendre polynomials with high 
order. In this case the integrands are highly oscillating functions and the 
assumption that the Fresnel coefficients are constant can lead to erroneous 
result. 

Next, we perform computer simulations in order to investigate the validity 
of the approximate model. Essentially, we compute the normalized differen- 
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tial scattering cross section (DSCS) of polystyrene particles with a refrac- 
tive index of 1.59 deposited on a silicon substrate with a refractive index of 
3. 88-1-0. 02j. Assuming the incident field to have unit amplitude we evaluate 
the normalized DSCS in the azimuthal plane (/J = 0°, which corresponds to 
the plane of incidence. The wavelength of the incident wave is chosen to be 
A = 0.6328 ^m. The excitation light is a P-polarized plane wave having an 
inclination from the normal of 60° . 

Figure 2 shows the normalized DSCS of spherical particles with different 
radii a deposited on the substrate. The curves correspond to the methods 
which use the exact and the approximate reflection matrix. It is obvious that 
the relative errors of the normalized DSCS computed by the approximate 
method increases with the diameter of the particle. 




Fig. 2. Normalized differential scattering cross section (DSCS) of spherical particles 
with different radii a deposited on the substrate. The DSCS is normalized by 
and is evaluated in the azimuthal plane = 0°. The wavelength of the incident 
wave is A = 0.6328 pm, and the excitation light is a P-polarized plane wave having 
an inclination from the normal of 60°. 



In Fig. 3 we consider a spherical particle with a = 0.3 pm situated on 
the surface and at a distance Zq = 0.4 pm above the surface. The plotted 
data show that the approximate method leads to accurate results when the 
distance between the particle and the substrate increases. 

The results plotted in Fig. 4 correspond to an oblate with semiaxes a = 0.1 
pm and 6 = 0.2 pm. Two situations are considered: zq = 0.1 pm and zq = 0.2 
pm. Again, the relative errors of the normalized DSCS are significant when 
the particle is on the substrate and decrease with increasing the distance Zq- 
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Fig. 3. Normalized differential scattering cross section (DSCS) of a spherical par- 
ticle with a = 0.3 /rm sitnated on the surface and at a distance zq = 0.4 /rm above 
the surface. The DSCS is normalized by and is evaluated in the azimuthal plane 
(p = 0°. The wavelength of the incident wave is A = 0.6328 ^m, and the excitation 
light is a P-polarized plane wave having an inclination from the normal of 60°. 




Fig. 4. Normalized differential scattering cross section (DSCS) of an oblate with 
semiaxes a = 0.1 /rm and b = 0.2 fim. Two situations are considered: zo = 0.1 
fim and zo = 0.2 /rm. The DSCS is normalized by 7ra^ and is evaluated in the 
azimuthal plane ip = 0° . The wavelength of the incident wave is A = 0.6328 /rm, 
and the excitation light is a P-polarized plane wave having an inclination from the 
normal of 60°. 
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We conclude this section by noting that the approximate model can leads 
to erroneous results for 1) particles with extreme geometries, when 2) the 
particle is deposited on the surface, or when 3) the main dimension of the 
particle is comparable with the incident wavelength. 
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Abstract. Multiple scattering of an incoherent beam of polarized light propa- 
gating through a random medium is studied using Monte-Carlo simulations. The 
medium comprises a slab of monodispersive spherical Rayleigh particles. Spatial, 
angular and path-length distributions of the scattered light are examined in both 
transmission and backscatter. The change of polarization state is discussed as a 
function of propagation distance for two initial states. Contrasting linear with cir- 
cularly polarized light shows the former to be better preserved in all the results 
obtained. Backscattered light is dominated by single scattering events occurring 
from the medium closest to the exit plane. 



1 Introduction 

Multiple scattering of radiation has been the subject of many investigations 
over the last 50 or so years [l]-[23] being motivated by a desire to understand 
the behaviour of light passing through or scattering from an inhomogeneous 
atmosphere. There have been many theoretical attempts to solve the problem, 
the first of which was a continuum theory where the transport of energy was 
key. This radiative transfer theory [2,3] has proved difficult to solve for all 
but the simplest cases, such as plane wave propagation through plane-parallel 
atmospheres, hence other methods have been devised. These range from the 
‘small angle approximation’, where the direction of a ray after scattering is 
a perturbation of its direction prior to scattering, to diffusion theory, where 
the direction of a ray after scattering is isotropically distributed [4]. More 
recent work on beam propagation includes calculations of the impulse shape 
using the diffusion approximation and show its dependence on the width of 
a Gaussian profile beam [5]. 

The temporal spread of backscattered radiation was examined experi- 
mentally and theoretically using linearly polarized light [6]. It was shown 
that light returning to a detector earliest has its initial polarization state 
largely intact, whilst at later times the light becomes depolarized. In [7] was 
noted sinusoidal azimuthal variations in the co- and cross-polar intensities for 

F. Moreno and F. Gonzalez (Eds.): Lectures 1998, LNP 534, pp. 135-158, 2000. 

© Springer-Verlag Berlin Fleidelberg 2000 




136 K.I. Hopcraft et al. 



when the equation of radiative transfer was solved for a plane wave propa- 
gating at normal incidence to a slab comprising spherical particles. Angular 
lineshape calculations were studied in [8, and references therein] for coherent 
backscatter in the regime of weak localization and found qualitative agree- 
ment with experimental results, and similarly [9] found good agreement with 
experiments for angular lineshapes in both transmission and reflection using 
a much simplified multiple scattering theory. In the topic of diffusing-wave 
spectroscopy [10], the way in which the polarization state is affected by pro- 
pagation through random media and differences between the behaviour of 
linearly and circularly polarized light has been addressed. This was further 
commented upon in [11] in the context of entropy production. 

The purely analytic work has been restricted to the limits of multiple 
scattering where the number of scattering events is very low or very high. 
The entire range of light scattering can however, be explored with numerical 
simulations. One class of computer simulations is known as the ‘Monte-Carlo’ 
method [26,27] which is a stochastic approach to problem solving. This is 
particularly useful in the study of incoherent light scattering as it simulates 
the flow of energy through a medium. 

This paper provides results from a classical Monte-Carlo simulation of 
vector light scattering for the purpose of investigating the behaviour of pola- 
rized light as it propagates through a medium. The adjective ‘classical’ refers 
to the simulation being a one-to-one analogue of the physical process. Non- 
classical simulations are ones in which variance reduction techniques have 
been introduced to accelerate the statistical convergence of results [15,19]. 
Monte-Carlo simulations have been used many times in the past for unpo- 
larized light [12-15], but fewer workers have analysed polarized light, see 
however [16-23]. Most of these simulations have been written for specific ap- 
plications from lidar evaluations, through visibility studies, to astrophysical 
simulations. This paper sets out to rigorously examine the polarization pro- 
perties of multiply scattered light in its entirety. Whilst a few aspects have 
been touched upon by the aforementioned work, those simulations were tied 
to particular configurations of detection such as small held of view and mea- 
suring exclusively in transmission or backscatter using linearly or circularly 
polarized light. This paper examines the cases of linear and circular initial 
polarization states extensively with all permutations of output variables (in- 
cluding exit positions, exit angles and path lengths) and their influence on 
final polarization states. The differences in behaviour of linear and circular 
polarization reported in [10] have, in part, prompted this study to examine 
the possibility of exploiting the use of polarization information as a discrimi- 
nant for imaging and detection in scattering media. 

There are two ways to incorporate polarization information into a Monte- 
Carlo simulation: by using Stokes vectors with the full scattering matrix 
[16,17,19-23] or using a simple helicity state [18]. Here the former method is 
used as the latter treats circularly polarized light alone. It should be stres- 
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sed that, with notable exceptions [16,21,23], few previous simulations have 
implemented polarization effects rigorously. The simulation described here 
explores some of the more fundamental aspects of the vector multiple scat- 
tering problem. 

The spherical scatterers are assumed to be small enough for the Rayleigh 
approximation to apply [24]. Among the past simulations, other scattering 
models for the medium, such as an ensemble of Mie scatterers [24] (descri- 
bing exactly spherical particles of arbitrary size) and the heuristic Henyey- 
Greenstein model [25] have been used, see [16,17,20,21,23] and [15,18,19,22] 
for respective examples of such usage. 

The results presented have a variety of applications, including: studying 
the transfer of solar radiation through clouds [12]; simulating backscatte- 
ring/transmission of laser light from/through clouds [13,17]; studying visi- 
bility in foggy conditions [14]; examining image formation through turbid 
media [15]; establishing whether polarized light can discriminate short-path 
photons [18]; studying polarized pulse propagation through turbid media [19]; 
determining the influence of particle size on the depolarization of light [20] ; 
simulating polarization maps of light emissions from protostellar dust enve- 
lopes [21,22]; and to simulate light emissions from dusty spiral galaxies [23]. 

In the next section, the structure of the Monte-Carlo code is described 
together with an explanation for how the direction of a light ray and its po- 
larization state changes upon scattering from a particle. Section 3 defines the 
quantities measured from the simulation data, such as the degree of polariza- 
tion and shows the results of such measurements. Further diagnostics derived 
from the data are obtained in Sect. 4, which shows the dependence of the 
degree of polarization and other quantities on exit position, exit angles and 
propagation path- length. Section 5 provides a summary and conclusion. The 
appendix gives details for how the polarization state relates to the scattering 
geometry and derives the phase function for the scattering angles. 

2 Description of Model 

This section outlines a Monte-Carlo simulation for describing the characteri- 
stics of polarized radiation that exactly models the scattering events caused 
by small (sub-wavelength sized) spherical particles. It predicts the final desti- 
nation of multiply scattered rays by launching a beam from the origin that 
propagates parallel to the z-axis in the first instance, as depicted in Fig. 1. 

The instantaneous polarization state of each ray is carried along its jour- 
ney by the simulation. The scattering medium occupies the semi-infinite half- 
space z > 0 and comprises an ensemble of Rayleigh scatterers distributed 
randomly throughout the medium. 

The Monte-Carlo simulation requires three elements to model the scatte- 
ring. These determine: 

• when and where an individual scattering event occurs 
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boundary surface 

plane 



Fig. 1. An illustration of the simulation geometry 



• how the interaction with a particle alters both the trajectory and subse- 
quent polarization state of a ray 

• how the local coordinate system carried by a ray changes after a scattering 
event. 

In addressing the first element it is assumed that scattering medium is 
isotropic and homogeneous, hence the probability density for the distance s 
travelled by a ray before it encounters a particle is determined by a negative 
exponential distribution with a given mean free path characterizing the size 
(or length scale) of the medium. Altering the size of the mean free path 
would make the medium more, or less dense but is here set to unity to define 
a scale length. A given ray propagates a random distance s drawn from the 
distribution before encountering a particle. The particle then scatters the ray, 
changing its direction of propagation and polarization state. The state of the 
ray is described in terms of the Stokes vector [2,24], 

S = (I,Q,U,V). (1) 

The first component / represents the intensity of the ray. In the calculati- 
ons presented this component is assigned a value of unity, i.e. as if scattering 
from a particle and the propagation between particles are both lossless. Q is 
the difference between the intensities of two orthogonal polarization compo- 
nents resolved at angles of 0 and tt/2 relative to a reference axis in the plane 
perpendicular to the ray direction. [/ is a similar measure with polarization 
components resolved at 7t/4 and 37t/4 in the same plane. V is the difference 
in intensity between right and left circularly polarized components. 

The second element of the simulation concerns the trajectory and pola- 
rization state of a ray subsequent to a scattering event. This requires kno- 
wledge of the scattering amplitudes of the particles comprising the medium. 
For Rayleigh scatterers the angular part of the intensity distribution of the 
scattered light, usually referred to as the ‘phase function’, adopts the simple 
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z 




Fig. 2. The scattering geometry of the ray directions 



analytical form 

P{fi, <P, S) = [(/t^ + l) + “ l) {Q cos 2<P + U sin 2^)] , (2) 

where fx = cos 6*, 0 is the scattering angle between the incoming and out- 
going ray directions and is the azimuthal angle through which the ray is 
rotated (see Fig. 2). This phase function acts as a joint probability density 
function for determining the direction in which the ray propagates following 
its encounter with the particle and depends upon the polarization state of 
the impinging ray. In Fig. 3 the polar surface plots show the phase func- 
tion for linear and circular polarized light. The linear phase function is not 
axisymmetric about the incident light direction due to the <P dependence of 
(2). The circular phase function is axisymmetric because the incoming light 
has no Q or 17 components. The linear phase function is broader in the y 
direction and has a null point in the x direction. It is also important to note 
the forward/backward symmetry of the scattered intensity arising because of 
the small particle size. 

The scattered intensities in particular polarization states are calculated in 
the appendix and the results are depicted in Figs. 4 and 5. The co-polarized 
component shown on the left of Fig. 4 has larger magnitude than the cross- 
polar component shown on the right. The cross-polar component has several 
null points, two of which are in the direct forward and back-scattered di- 
rections. This is also the case for circularly polarized light as shown in Fig. 
5, the right-forward component is much larger than left-forward component. 
The mirror symmetry between right and left components stems from the con- 
vention of using helicity as a reference. The backward left component rotates 
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Fig. 3. Polar plots of the phase function for scattering from Rayleigh particles. The 
left figure shows the linearly polarized case and the right is for circularly polarized 
light. The incident light propagates in the jjositive z direction and scatters from a 
particle situated at the origin. The linearly polarized light is initially polarized in 
the X direction. The large arrows show the incident direction and the small arrows 
indicate the initial polarization stale 



in the same sense as the forward riglit component but their propagation direc- 
tions are opposite. Tlie fraction of light scattered into tlie other polarization 
state can be calculated from (16) to (21) in the appendix. For linearly po- 
larized light, this fraction is 1/16 and for circular polarization it is 1/8. So 
from each scattering it is apparent that circularly polarized light is twice as 
likely to be scattered into its opposite state as linearly polarized light. 

The scatter angles for determining the direction of a ray are generated 
using the inversion and rejection methods on the marginal and conditional 
probabilities derived from the pha.se function |27,28]. Of the published pola- 
rization simulations mentioned in the introduction, only two use non-uniform 
distributions for <P [21,23|. The other simulations [17-20,22] use the uniform 
distribution for <P which is a questionable simplification due to the depen- 
dence of the phase function on the incoming polarization state and <P in (2) 
and the form that these take as shown in Fig. 3. 

The new Stokes vector S' that results from the scattering interaction is 
easily determined through the .Mueller matrix M which, in this instance [24] 
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Fig. 4. Polar plots of single-scattering intensities for linear input. The left figure 
shows the co-polarized component and the right is for cross-polarized light 




Fig. 5. Polar plots of single-scattering intensities for circular input. The left fi- 
gure shows the right-circularly polarized component and the right figure is for left- 
circularly polarized light 
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is a simple function of the scattering angle O 



M(m) 
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2 
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\ 0 0 0 2/r/ 



(3) 



The final element of the simulation determines the local coordinate trans- 
formation that is required following the scattering event. This transformation 
defines the plane in which the polarization vector lies along with the direc- 
tion of the reference axis. The new Stokes vector S' is obtained from the old 
Stokes vector S using 



S' = T(-a)M(^)T(<l>)S 
where T is the rotation matrix 

/l 0 0 0\ 

, , _ 0 cos2q; sin2of 0 

0 — sin 2a cos 2a 0 ’ 

\0 0 01 / 



( 4 ) 



( 5 ) 



and a is the angle labelled in Fig. 2. 

There is a finite probability that a few rays may meander in a lateral 
direction within the slab and consequently may travel a long distance be- 
fore emerging from the exit plane. Introducing a hemispherical boundary of 
predetermined size (taken to have a radius of sixty mean free paths in the si- 
mulations that follow) limits the extent of the scattering domain and also the 
high computational overhead that entails following such errant rays. Hence 
any ray that traverses this boundary is effectively absorbed. 

Figure 6 shows the logical steps for the simulation. Each ray carries with 
it a set of variables describing its state, these are: 



• the position of the ray head r 

• the direction of the ray d 

• the polarization state S 

• the total distance travelled by the ray St 

• the maximum depth penetrated by the ray into the medium Zmax 

• the number of scattering events experienced by the ray fVscat- 

The four main stages to the simulation [26,3,1] are: 

1. a specific initial polarization state is assigned which is then propagated 
to its first randomly situated scattering site 

2. the scattering angles are generated using the phase function (2) and the 
new polarization state is obtained using (4) 
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Repeat 
N times 




Fig. 6. Flow of Monte-Carlo simulation 



3. tests are performed to ascertain whether a ray has passed any relevant 
surfaces, such as the exit plane, cut-off hemisphere or some other plane 
used for diagnostic purposes 

4. upon traversing a surface, the state of the ray (as listed in the description 
above) is calculated for the point of interception with the surface. If the 
surface is the exit plane, the ray is recorded. If the surface is the cut-off 
hemisphere, the ray is abandoned. In all other cases the ray’s state is 
recorded before it continues its journey by returning to stage 2. 



Once the ray has left the scattering domain or has been abandoned, the 
process is repeated a further — 1 times, where N is sufficiently large to 
ensure reliable statistics of the derived state variables. Further details of the 
scattering geometry and the way that the local coordinate system transforms 
are contained in the appendix. Pseudo-random numbers are generated using 
the routine rcui2 from [29]. This was selected because it has a very large 
period of 2 x 10^® guaranteeing that the same sequence of random numbers 
is not used during a simulation of ~ 10® rays. 
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3 Simulation Results and Discussion 



To generate data to study the behaviour of the polarization state of multiply 
scattered light, the simulation program described in the previous section was 
run for 2 million rays. All length measures are in units of mean free paths. 
Two initial polarization states are studied: linear with S = (1,1, 0,0) and 
right circular with S = (1,0,0,1). 

During the simulated propagation of each ray the first crossing of each 
of thirty internal boundaries is logged and the response of a virtual detector 
positioned at each of these planes can be calculated. The response of a par- 
ticular type of detection system is found for each ray by forming the scalar 
product of the ray Stokes vector with Si describing the detection system. 
For a detector with no polarization discrimination Si = (1, 0, 0, 0), a detector 
with a linear polarizer in the same orientation as the linearly polarized illu- 
mination (co-polar) has Si = (1, l,0,0)/2, a detector with a linear polarizer 
in the orthogonal orientation (cross-polar) has Si = (1, — 1, 0, 0)/2, a detec- 
tor with a circular polarizer in the same orientation as the right-circularly 
polarized illumination has Si = (1,0, 0, l)/2 and for a detector with a left 
circular polarizer has Si = (1,0,0, — 1)/2. 

The degree of polarization (dop) D quantifies the change in polarization 
of the scattered rays is defined by 



D, = 



I ^CO 



for linear polarization and by. 



Dr = 



bright 

bright 



.fleft I 



(6) 

(7) 



for circular polarization. 

In all plots presented the left vertical scale relates to the intensity and the 
right vertical scale to the degree of polarization of a particular state. Figure 7 
shows the intensity and degrees of polarization that would be detected in 
transmission through slabs of varying thickness by a spatially-integrating 
detector. 

The transmitted intensity decays from an initial value of unity to 0.053 at 
a depth z = 30. It decreases in a sub-exponential manner as the logarithmic 
plot in Fig. 8 shows. In fact the intensity has an asymptotic power law de- 
pendence with exponent —0.955. The unscattered remainder is governed by 
an exponential decay. The linear and circular values of D both decay much 
quicker than the transmitted intensity and beyond z = 6 they are less than 
0.1. The ratios for linear states always lie above those for the circular states. 
The jaggedness of the lines in Fig. 8 shows the effect of noise caused by stati- 
stical ffuctations in our finite sampling. This noise ultimately limits accuracy. 
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Fig. 7 . The transmitted intensity and degree of polarization as a function of slab 
thickness for linear and circular input polarizations. The left hand vertical scale 
corresponds to the intensity curve (valid for both linear and circular illumination) 
and the vertical scale on the right is for linear and circular degrees of polarization. 
The slab thickness is in units of mean free paths 
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Fig. 8. A log plot of transmitted intensity and degree of polarization against slab 
thickness for linear and circular input polarizations 



Any initial polarization state is scrambled upon the light having reached 
a depth of about 6 mean free paths. This is equivalent to a ray having ex- 
perienced (on average) just over 24 scatterings, which is in accord with the 
figure quoted in [6]. 
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The propagation characteristics of linear polarized light is a little more 
resilient to depolarization than is circular light. This is in agreement with 
earlier experimental [10], theoretical [11] and simulation [20] work. The dif- 
ference in polarization ‘memory’ was first discussed in [10] and has been 
attributed to the helicity inversion that circularly polarized light suffers due 
to the scatterer acting as an optical mirror [10,11,20]. This inversion does not 
occur for linear polarization so that state is better preserved. A similar line 
of reasoning leads to the opposite effect occurring for particles whose size is 
greater than the wavelength of the light. The circular state has greater immu- 
nity to depolarization on scattering than does the linear state. This happens 
since these particles scatter more in the forward direction than Rayleigh par- 
ticles and so there is less helicity inversion. However the calculations of the 
previous section indicate that for small particles the difference in behaviour 
stems from fundamental differences at the single scattering events where the 
initial polarization state is twice as likely to be changed for circularly polari- 
zed light. 

The third graph, Fig. 9, shows rays that are backscattered from media of 
various thicknesses. This is derived from the data by removing contributions 
from rays that have travelled past a hypothetical absorbent plane located at 
various depths. The backscattered intensity increases as the medium becomes 




Fig. 9. A plot of backscattered intensity and degree of polarization against slab 
thickness for linear and circular input polarizations 



thicker because rays have a greater chance to scatter back to the input plane. 
It should increase to unity for an infinitely thick slab, but our hemispherical 
cut-off surface limits the maximum value to about 0.975 since approximately 
2.5% of rays are abandoned after breaching the hemisphere. The linear degree 
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of polarization decreases from unity to 0.31 whilst the circular ratio declines 
further to 0.22. This shows that the backscattered light retains some of the 
original polarization, with the linear input better preserved for the same 
reasons as those detailed above. 

4 Spatial, Angular, and Path-Length Histograms 

Whenever a ray intercepts either the exit or an internal reference plane, its 
state is recorded as described in Sect. 2. In this section, the dependences 
of the intensity and the degree of polarization with position, direction and 
the length of path traversed are examined. The distributions of transmitted 
and backscattered intensities together with those for the degree of polariza- 
tion are shown as a function of the radial distance from the original axis of 
propagation to show the degree of beam spreading, and with polar angle to 
indicate the direction of the ray. Also shown is the azimuthal dependence of 
these quantities about this axis and the distribution of path-lengths. 




Fig. 10. Transmitted intensity per unit area as a function of radius from the 2 -axis 
at depths of 2 = 1, 2, 4, 8. 



In Figs. 10 & II the point of intersection that each ray makes with a 
reference plane is binned as a function of the radial distance p from the 2 - 
axis. The bin size is 0.25 mean free paths wide and the intensity has been 
scaled by the reciprocal of the midpoint radius of each bin to counter the 
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bias due to differing bin areas that occurs in a cylindrical-polar coordinate 
system. In the forward direction, only scattered rays are included in the 
ensemble. Unscattered rays propagate along the z-axis and therefore have a 
delta-function distribution at p = 0 which is not shown. 

From Fig. 10 it can be seen the intensities decrease in an approximately 
exponential manner for sufficiently large values of p, hence the beam broadens 
as it propagates forward. Near to the axis there is a predominance of single 
scattered light which has retained its polarization state and so the degree 
of polarization is close to unity. This feature is maintained with increasing 
depth into the medium. The linear degree of polarization is greater for all 
radial positions and depths than the circular dop. This is consistent with the 
spatial form of the single scattering intensity distributions shown in figures 
4 and 5. The graphs for backscattered rays in Fig. 11 are constructed for 




Fig. 11. Backscattered intensity per nnit area as a function of radius p from the 
a-axis. An absorbing screen is positioned at distances 2 = 1,2, 4, 8 



hypothetical absorbing screens positioned at distances z = 1, 2, 4, 8 mean free 
paths, thus simulating the effect of slabs having different thicknesses. These 
show little qualitative difference in the behaviour of the dops from those 
shown for the transmitted rays that reach a depth of one mean free path, 
thus demonstrating the fore-aft symmetry of Rayleigh scatterers. The results 
are insensitive to the slab thickness, showing that the backscattered results 
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are dominated by single scattering events from the immediate surrounding 
medium. 

The polar angle distribution of the exit rays is plotted in Figs. 12 & 
13. Here, 0 = 0° corresponds to the forward propagating direction and 6 = 
90° propagation perpendicular to the z-axis. The bin widths are 5° and the 
intensity is normalized by the sine of the angle between the z-axis and each 
bin’s midpoint. 




Fig. 12. Plots of transmitted intensity per unit solid angle against polar angle at 
depths of z = 1, 2, 4, 8. Only the contributions from scattered rays are shown. 



Again, only the scattered rays are considered. The unscattered rays have 
a delta- function distribution in the forward scattering direction 0 = 0°. In 
Figs. 12 & 13 the dop curves have similar behaviour to each other, though 
again the linear polarization is greater than circular polarization. By 8 mean 
free paths into the medium the polarization is no longer angle dependent and 
all memory of the initial polarization state has been erased. 

The distribution of the rays with azimuthal angle is shown in Figs. 14 & 
15. The bins are 10° wide. As before, only scattered rays are shown. In the 
forward direction at z = 1 the intensity curve for linear polarization varies 
sinusoidally with angle (j). This due to the variation of the linearly polarized 
intensity as depicted in Fig. 4. This strong variation with (j) is smoothed out 
with increasing propagation into the medium. Note also that the circular 
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Fig. 13. Plots of backscattered intensity per unit solid angle as a function of polar 
angle for slab thicknesses 2 : = 1, 2, 4, 8 



polarization state is initially isotropic with (f> and hence remains so with 
increasing propagation distance. 

The distributions for backscattered rays are shown in Fig. 15 for progres- 
sively thicker slabs. These exhibit a similar asymmetry in azimuthal distri- 
bution of the linear polarization state for the same reasons as given before. 
However the distributions are not smoothed out in this instance because of 
the dominant effect of single scattering events in the backscattered direc- 
tion. The general behaviour of the intensity distributions with (j) is similar 
to that found in [7] where the radiative transfer equation was solved nume- 
rically. This provides some credence for using continuum models to describe 
scattering behaviour from ensembles of discrete entities. 

Finally the logarithm of the distribution of the intensity as a function 
of the total path-length st traversed by rays before they exit the medium 
is shown in Figs. 16 & 17. The bin width is 0.5 mean free path. Note in 
Fig. 16 that the decay of the intensity is exponential for depths up to 2 = 4, 
beyond which a second maximum appears away from st = 2 . The second 
maxima indicate that the rays follow a most likely path length. This may be 
interpreted as the path length for which multiple scattering is most likely to 
occur since the inter-scatter propagation rule for single scattering no longer 
applies. The dop curves confirm this view as they indicate that most of the 
initial polarization information has been eradicated at large depths into the 
medium. The tails of these curves also have an exponential distribution. Note 
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Fig. 14. Plots of transmitted intensity per unit solid angle against azimuthal angle 
at depths of 2 = 1, 2, 4, 8. Only the contributions from scattered rays are shown. 



that the proportion of ballistic rays (st = z) decline with increasing depth 
into the medium, to the extent that there are almost none at a depth of 2 = 8. 

The logarithm of the backscattered intensity as a function of path-length 
is shown in Fig. 17. For thin slabs, the distribution has an exponential tail. For 
thicker slabs (z > 4) the backscattered intensity follows a power law at long 
path lengths with exponent —1.117. The dop curves decay away for longer 
path-lengths which shows that the rays that travel further are (obviously) 
scattered more times. 

5 Summary and Conclusions 

This paper has examined the way in which a beam of radiation has its state of 
polarization altered upon propagation through a random medium comprising 
an ensemble of spherical Rayleigh particles. The scattering phase functions 
particular to these particles are treated exactly and multiple scattering effects 
are modelled by tracing the destination and polarization state of a bundle of 
rays using a Monte-Carlo simulation. The scattering medium has a slab geo- 
metry and its optical density can be changed by altering the mean free path 
of a ray between successive particle encounters. Various measures can be con- 
structed from the simulation data that may be replicated under experimental 
conditions. This aspect of the present work has not been comprehensively ad- 
dressed in previous treatments of polarization effects in multiple scattering 








152 K.I. Hopcraft et al. 







Fig. 15. Plots of backscattered intensity per unit solid angle against azimuthal 
angle at screen positions of « = 1, 2, 4, 8 



situations, where the phase functions are taken to be heuristic, or the dia- 
gnostics have not been formulated in terms of experimental measurables. 

The chief result is that a linear polarization state is better preserved than 
a circular state whatever gauge of measurement is used. This is seen both in 
transmission through the slab and in backscatter and is quantified in terms 
of the dimensionless ‘degree of polarization’, being a ratio of the absolute 
difference between orthogonal polarization states and their sum. This robust 
result stems from the form that the scattering phase functions adopt for 
different polarization states, as depicted in Fig. 3. For linear states, the phase 
function is greatest in the plane containing the direction of propagation of 
the ray and normal to the polarization vector, and possesses a significant cleft 
in the plane orthogonal to this. By comparison, the circular case has a more 
isotropic distribution, with the cleft being effectively filled in. 

Unsurprisingly, multiple scattering causes the initially collimated beam to 
spread with depth into the medium and also in the backscattered direction. 
Close to the axis the degree of polarization is greatest because most of the 
light is either unscattered or has been scattered only once. Figure 11 shows 
the spread of backscattered light for slabs of increasing thickness, or equi- 
valently, decreasing mean free path between scatterers. The curves depicting 
the degree of polarization are essentially independent of the depth of the 
medium, indicating that single scattering effects are dominant in backscat- 
ter. Allied to the spreading of the beam is the direction in which the rays 
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Fig. 16. The logarithm of transmitted intensity as a function of path-length traver- 
sed by a ray at reference depths of z = 1,2, 4, 6, 8. Each horizontal scale has been 
offset by the depth of each measurement plane. The last graph has a bin width of 
1 mean free path and furthermore its intensity has divided by two to ensure the 
same vertical scaling 



are propagating when detected and this aspect will clearly impinge upon 
imaging considerations. The degree of polarization is weakly dependent on 
the polar angle for 6 < 40°, denoting the direction of the ray (as shown in 
Figs. 12 & 13) but the linear state is preserved better than the circular, a 
result that once again can be attributed to the form of the phase function 
(2) and the single scattering intensities (16-21). The invariance of the cur- 
ves for backscattered light with increasing slab thickness (Fig. 13) reinforces 
the view that backscatter is dominated by single scattering effects from the 
medium. In transmission the direction of the rays are effectively randomized 
after propagating through eight mean free paths into the medium. The degree 









154 K.I. Hopcraft et al. 





xls 



xls 



Fig. 17. Logarithm of the backscattered intensity as a function of path-length for 
slabs of thickness 2 = 1, 2, 4, 8 



of polarization in either state is shown as a function of azimuthal angle in 
Figs. 14 & 15 and these illustrate the isotropy /anisotropy between circular 
and linear polarization which ties in with the results of [7]. Once again the 
linear state is essentially randomized on propagating about eight mean free 
paths into the medium. 

A useful diagnostic of the model is the path length taken by each ray. This 
can be interpreted as the time of flight taken by a photon. Figures 16 & 17 
show that for a given slab thickness there is a most likely path length for an 
ensemble of rays to follow. The probability for the occurrence of unscattered 
‘ballistic’ rays, which are observed for thin slabs, naturally decreases with 
increasing slab thickness. 
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A Appendix: Scattering Geometry and Derivation of 
the Phase Function 

This appendix gives further details of the scattering geometry and represen- 
tation of the polarization state used in the simulations. A ray having initial 
direction d and polarization S is scattered by angle O and rotated by to a 
new direction d' and new polarization S'. Figure 2 illustrates the ray direc- 
tions where the arcs are parts of great circles on a unit sphere so the spherical 
polar coordinates of d, angles (0, </>) are arc lengths zP and xQ. For d' (0', (/>') 
are the lengths zP' and xQ'. These angles are related by 

d • d' = cos O (8) 

and 

d' = R,(<^)R^(0)R,(<?)R^(e)R^(-0)R,(-</>)d (9) 

where Rj, and R^. are rotation matrices about the y and z axes respectively. 

Linearly polarized light can be resolved into two orthogonal components 
in a plane perpendicular to its direction of propagation. To do so requires a 
reference polarization axis relative to which these components can be mea- 
sured. A local coordinate frame is attached to each ray which is transformed 
whenever the ray changes direction. The reference axis is defined by the local 
9 direction in the spherical polar representation of the ray’s direction vector 
d. This axis is also the line of intersection between the plane at P that is 
perpendicular to d and the plane containing both the z-axis and d. In Fig. 2 
the reference axis is the tangent to the great circle that passes through Pz. 
However if a ray travels in the z direction, the x-axis is used as the reference 
axis. 

When a redefinition of the polarization reference axis takes place the 
components of the Stokes vector must also transform as 

S T(a)S (10) 

where a is the angle of rotation with respect to the ray direction and T is 
given by (5). Note that the first and last components of S are invariant to 
rotations. 

The Mueller matrix M describes how an optical device affects light re- 
presented by a Stokes vector: 



S-^MS. (11) 

So for a scattering event M would describe the way the polarization trans- 
forms. 

To keep the adopted polarization convention intact requires pre-rotating 
the reference axis, which lies along the arc zP, to the scattering plane defined 
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by POP^ After scattering, a post-rotation is applied so that the reference axis 
lies on the arc zP' , in keeping with the stated convention. Hence the Stokes 
vector must be transformed accordingly: 

S' = T(-a)M(6), <!>, 0, (l))T{<P)S (12) 



where for spherical particles M is given by (3) and depends on 0. From 
figure 2, the angle a is given by 



cos a = 



cos 0 — cos 0' cos 0 
sin 0' sin 0 



(13) 



The domain of a depends on the value of in the following manner: 0 < 
a < 7T for 0 < ^ < 7T and — tt <a<0for7r<<?< 27 t. It is also important 
to note that there are several special cases that arise from using a spherical 
polar coordinate system. In these cases, (9), (12) & (13) can be singular and 
the transformed quantities must be calculated in an alternative manner. This 
particular detail is crucial to the implementation of the simulation as errors 
(such as dividing by zero) can otherwise occur. These singularities are due to 
the choice of coordinate system. 

The phase function follows from (12). Generally the scattered intensity 
can be written in the form 



/'(d')ocP(0,^,S,d)/(d). (14) 

The phase function is normalized according to the relation 

/ dl7 P(6),<l>,S,d) = 1 VSandd, (15) 

J all angles 

which determines the constant of proportionality in (14). For Rayleigh scat- 
terers the phase function (2) does not depend on d because the scattering 
particle is assumed to be a sphere. 

The state of a singly scattered ray is easily deduced from (4) . When the 
light is linearly polarized, the total scattered intensity is 

I = (1/2) [/i^ -k 1 -k - 1) cos 2<P] , (16) 

the co-polar intensity is 

/eo = (1/4) M + 1 + (ImI - 1) COS2<l>]" (17) 

and cross-polar intensity is 

/cross = (1/4) (|M|-l)"sin2 2<Z>. (18) 

For incident light that is circularly polarized the total intensity is 

/ = ( 1 / 2 ) + 1 ) , 



(19) 
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the right-circularly polarized intensity is 

/right = (1/4) (m+1)" (20) 

and left circular intensity is 

/left = (1/4) (M-l)^ (21) 

The phase functions are plotted in Fig. 3, the intensities in each polarization 
are shown in Figs. 4 and 5. 



References 

1. S. Karp, R.M. Gagliardi, S.E. Moran & L.B. Stotts (1988) Optical Channels, 
Plenum Press, NY 

2. S. Chandrasekhar (1960) Radiative Transfer, Dover Publications, NY 

3. J.J. Dnderstadt & W.R. Martin (1979) Transport Theory, John Wiley & Sons, 
NY 

4. A. Ishimaru (1978) Wave Propagation and Scattering in Random Media Vol. 
1 & 2, Academic Press, NY 

5. S. Ito (1981) “Theory of beam light pulse propagation through thick clouds: ef- 
fects of beamwidth and scatterers behind the light source on pulse broadening,” 
Appl. Opt. 20, 2706-15 

6. K.M. Yoo & R.R. Alfano (1989) “Time resolved depolarization of multiple 
backscattered light from random media,” Phys. Lett. A 142, 531-5 

7. R.L-T. Cheung & A. Ishimaru (1982) “Transmission, backscattering, and de- 
polarization of waves in randomly distributed spherical particle,” Appl. Opt. 
21, 3792-3798 

8. M.J. Stephen & G. Cwilich (1986) “Rayleigh scattering and weak localization: 
Effects of polarization,” Phys. Rev. B 34, 7564-72 

9. D. Eliyahu, M. Rosenbluh & I. Freund (1993) “Angular intensity and polariza- 
tion dependence of diffuse transmission through random media,” J. Opt. Soc. 
Am. A 10, 477-91 

10. F.C. MacKintosh & S. John (1989) “Diffusing-wave spectroscopy and multiple 
scattering of light in correlated random media,” Phys. Rev. B 40, 2383-406 ; 
K.J. Peters (1992) “ Coherent-backscatter effect: A vector formulation accoun- 
ting for polarization and absorption effects and small or large scatterers,” Phys. 
Rev. B 46, 801-12 ; F.C. MacKintosh, J.X. Zhu, D.J. Pine & D.A. Weitz (1989) 
“Polarization memory of multiply scattered light,” Phys. Rev. B 40, 9342-5 

11. D. Bicout & C. Brosseau (1992) “Multiply scattered waves through a spatially 
random medium: entropy production and depolarization,” J. Phys. I France 2, 
2047-63 

12. G.N. Plass & G.W. Kattawar (1992) “Monte Carlo calculations of light scatte- 
ring from clouds,” Appl. Opt. 7, 415-9 

13. G.N. Plass & G.W. Kattawar (1973) “Reflection of light pulses from clouds,” 
Appl. Opt. 10, 2304-10 (1971); E.A. Bucher, “Computer simulation of light 
pulse propagation for communication through thick clouds,” Appl. Opt. 12, 
2391-400 




158 K.I. Hopcraft et al. 



14. B. Maheu, J.-P. Briton & G. Gouesbet (1989) “Four-flux model and a Monte 
Carlo code: comparisons between two simple, complementary tools for multiple 
scattering calculations,” Appl. Opt. 28, 22-4 ; J.-P. Briton, B. Maheu, G. 
Grehan & G. Gouesbet (1992) “Monte Carlo simulation of multiple scattering 
in arbitrary 3-D geometry,” Part. Part. Syst. Charact. 9, 52-8 

15. P. Bruscaglioni, P. Donelli, A. Ismaelli & G. Zaccanti (1991) “A numerical 
procedure for calculating the effect of a turbid medium on the MTF of an optical 
system,” J. Mod. Optics 38, 129-42 ; P. Donelli, P. Bruscaglioni, A. Ismaelli 
& G. Zaccanti (1991) “Experimental validation of a Monte Carlo procedure for 
the evaluation of the effect of a turbid medium on the point spread function of 
an optical system,” J. Mod. Optics 38, 2189-201 

16. G.W. Kattawar & G.N. Plass (1968) “Radiance and polarization of multiple 
scattered light from haze and clouds,” Appl. Opt. 7 , 1519-27 ; G.W. Kattawar 
& G.N. Plass (1972) “Degree and direction of polarization of multiple scattered 
light. 1: Homogeneous cloud layers,” Appl. Opt. 11, 2851-65 

17. T. Aruga & T. Igarashi (1981) “Narrow beam light transfer in small particles: 
image blurring and depolarization,” Appl. Opt. 20, 2698-705; corrected in T. 
Aruga & T. Igarashi (1981) Appl. Opt. 20, 3831 

18. J.M. Schmitt, A.H. Gandjbakhche & R.F. Bonner (1992) “Use of polarized 
light to discriminate short-path photons in a mulitply scattering medium,” 
Appl. Opt. 31, 6535-46 

19. P. Bruscaglioni, G. Zaccanti & Q. Wei (1993) “Transmission of a pulsed polari- 
zed light beam through thick turbid media: numerical results,” Appl. Opt. 32, 
6142-50 

20. D. Bicout, C. Brosseau, A.S. Martinez & J.M. Schmitt (1994) “Depolarization 
of multiply scattered waves by spherical diffusers: Influence of the size parame- 
ter,” Phys. Rev. E 49, 1767-70 ; A.S. Martinez & R. Maynard (1994) “Faraday 
effect and multiple scattering of light,” Phys. Rev. B 50, 3714-32 

21. O. Fischer, Th. Henning & H.W. Yorke (1994) “Simulation of polarization 
maps. I. Protostellar envelopes,” Astronomy and Astrophysics 284, 187-209 

22. W.J. Henney & D.J. Axon (1995) “Polarization profiles of scattered emission 
lines. HI. Effects of multiple scattering and non-Rayleigh phase functions,” 
Astrophys. J. 454, 233-53 

23. S. Bianchi, A. Ferrara & G. Giovanardi (1996) “Monte Garlo simulations of 
dusty spiral galaxies: Extinction and polarization properties,” Astrophys. J. 
465, 127-44 

24. H.G. van de Hulst (1981) Light Scattering by Small Particles, Dover Publica- 
tions, NY 

25. L.G. Henyey & J.L. Greenstein (1941) “Diffuse radiation in the galaxy,” Astro- 
phys. J. 93, 70-83 

26. E.D. Cashwell & G.J. Everett (1959) Monte Carlo Method for random walk 
problems, Pergamon Press, NY 

27. P. Bratley, B.L. Fox & L.E. Schrage (1983) A Guide to Simulation, Springer- 
Verlag, NY 

28. B. Jansson (1966) Random Number Generators, Victor Pettersons, Stockholm 

29. W.H. Press, S.A. Teukolsky, W.T. Vetterling & B.P. Flannery (1992) Numerical 
Recipes in C, 2nd Ed. CUP 

30. A.P. Bates, K.I. Hopcraft & E. Jakeman (1997) “Particle shape determination 
from polarization fluctuations of scattered radiation,” J. Opt. Soc. Am. A 14, 
3372-8 




9 Polarization and Depolarization of Light 



M.I. Mishchenko and L.D. Travis 

NASA Goddard Institute for Space Studies, 

2880 Brodway, New York, New York 10025, U.S.A. 



Abstract. Electromagnetic scattering by a small particle or a collection of small 
particles can produce light with polarization characteristics different from those of 
the incident beam. If the incident beam is unpolarized, the scattered light generally 
has at least one nonzero Stokes parameter other than intensity, and this phenome- 
non is often called ’’polarization.” When the incident beam if fully linearly or circu- 
larly polarized, the scattered light may become partially polarized or even totally 
unpolarized, and this phenomenon is called ’’depolarization.” In this paper we use 
exact solutions of Maxwell’s equations and the vector radiative transfer equation to 
study the dependence of polarization and depolarization on such characteristics of 
scattering particles as size, refractive index, and shape. We also discuss the use of 
polarization and depolarization phenomena in remote sensing studies and particle 
characterization. 



1 Introduction 

The polarization state of a beam of light is traditionally described by a vector 
I = {I, Q, U, Vy composed of four Stokes parameters (T means transpose). 
The first Stokes parameter, I, is the intensity, while the other three para- 
meters describe the polarization state of the beam. The Stokes parameters 
are always defined with respect to a reference plane, e.g., with respect to the 
meridional plane of the beam in a spherical coordinate system (Fig. 1): 



I = E^El + EpE;, 


(1) 


Q = E^El - EpEy 


(2) 


U = -E^E; - EpEl 


(3) 


V = i{EpEl-EsE*y, 


(4) 



where and E^p are electric field components and the asterisk denotes a 
complex conjugate value [1]. The scattering of a beam of light by a single 
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particle or a collection of particles is fully characterized by a 4x4 matrix F 
(often called the Mueller matrix) which describes the transformation of the 
Stokes vector of the incident beam into the Stokes vector of the scattered 
beam. 

z 




Fig. 1. Spherical coordinate system used to define the Stokes parametres of a beam 
of light propagating in the direction specified by the unit vector n. 



jsca _ -pjinc 

Electromagnetic scattering most typically produces light with polariza- 
tion characteristics different from those of the incident beam. If the incident 
beam is unpolarized, the scattered light generally has at least one nonzero 
Stokes parameter other than intensity, and this phenomenon is often called 
’’polarization.” When the incident beam is fully linearly {I = Q,U = V = 0) 
or circularly {I = V,Q = U = 0) polarized, the scattered light may become 
partially polarized or even totally unpolarized, and this phenomenon is cal- 
led ’’depolarization.” Polarization and depolarization can be caused by both 
single and multiple scattering and depend on geometrical and physical cha- 
racteristics of the scattering particle(s) such as shape, morphology, refractive 
index, size parameter (ratio of the particle circumference to the wavelength 
of the incident light), and orientation with respect to the coordinate system. 

In this paper we analyze polarization and depolarization mechanisms for 
single scatterers and multi-particle configurations using exact numerical so- 
lutions of Maxwell’s equations and the vector radiative transfer equation. We 
also discuss how the strong sensitivity of polarization and depolarization to 
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particle parameters makes both phenomena a powerful tool in remote sensing 
studies and particle characterization. 



2 Single Scattering 

Consider single scattering by a small- volume element composed of a collection 
of sparsely distributed, independently scattering particles (see the following 
section for a discussion of what interparticle separation makes the particles 
independent scatterers) . For simplicity, we will assume that the incident beam 
is directed along the z axis and consider scattering in the xz plane. In this case 
the xz plane is also the scattering plane and the zenith angle of the scattered 
beam -d is also the scattering angle, 0. If the particles comprising the small- 
volume element are randomly oriented and have a plane of symmetry, the 
Mueller matrix has the well-known, simplified block-diagonal structure, [2]. 



F(0) 



Fii F\2 0 0 

F\2 F 22 0 0 

0 0 F33 F34 ’ 

0 0 — F34 F44 



( 6 ) 



and has only six independent elements. It is easily seen from (5) and (6) that 
the Stokes vector of the scattered light is given by 



>11 -h Fi2Q“^ 

rp Tine I 77' /^inc 
Tsca _ ^12J~ -r-t22W 

^ 331 /“'= -k F34M“° Cl 

-F34C/““ -k F44M“= 

and, in general, differs from Even if the incident beam is unpolarized, 
i.e., = (/, 0,0,0)^, the scattered beam has a nonzero Stokes parameter 

Q, unless the (1,2) element of the Mueller matrix is equal to zero. [The latter 
is always true in the exact forward {0 = 0) and exact backward (0 = tt) 
directions.] This phenomenon is usually called polarization. 

Figure 2 shows the phase function defined as 



7T 

p{0) = 2Fn{0)/ j Eii(0')sin0'd0' (8) 

0 

and the normalized Mueller matrix elements for a power law size distribu- 
tion of spheres and surface-equivalent, randomly oriented oblate spheroids [3] 
with an aspect ratio of 1.7. The refractive index is 1.53-k0.008i and the ef- 
fective size parameter of the size distribution, Xefft is 15. The computations 
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were performed using the conventional Lorenz-Mie theory for spheres [2] and 
the T-matrix method for spheroids [3] . There are several obvious differences 
between the Mueller matrix elements for spheres and spheroids. The phase 
function for spheroids exhibits an enhanced side-scattering and a suppressed 
backscattering. The (1,1) and (2,2) elements for spheres are the same, whereas 
for spheroids they are significantly different. The same is true for the (3,3) 
and (4,4) elements. The degree of linear polarization —F 12 /F 11 for spheroids 
is positive at side-scattering angles and is negative for spheres. We thus see 
that the elements of the Mueller matrix are highly sensitive to particle shape. 
Figures 3 and 4 as well as extensive computations reported by Hansen and 




Fig. 2. Phase function and normalized Mueller matrix elements for a power law 
size distribution of spheres and randomly oriented spheroids. The refractive index 
is 1.53-1-0.0081 and the effective size parameter of the size distribution is 15. 



Travis [4] , Mishchenko and Travis [5] , Mishchenko et al. [3] , and Mishchenko 
and Travis [6] show that they are equally sensitive to the particle refractive 
index and size parameter. This sensitivity makes accurate measurements of 
the Mueller matrix a valuable means of particle characterization. 

In the exact backscattering direction, the Mueller matrix becomes diago- 
nal and has only two independent elements [7]: 



F(7t) = (iiag[Fii(7r),F22(7r),-F22(7r),Fii(7r) - 2F22(7t)]. (9) 

Assume now that the incident beam is 100% linearly polarized parallel 
to the scattering plane so that = /“‘^(l, 1, 0, 0)^. The second Stokes 
parameter of the backscattered signal is not, in general, equal to the first 
Stokes parameter and this phenomenon is traditionally called linear depo- 
larization. The linear depolarization ratio, i.e., the ratio of the flux of the 
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Fig. 3. Phase function and normalized Mueller matrix elements for a narrow gamma 
size distribution of spheres. The refractive index is 1.45 and the effective size para- 
meter of the size distribution varies from 1 to 20. 




SCATTERING ANGLE (DEG) SCSATTERING ANGLE (DEG) SCATTERING ANGLE (DEG) 



Fig. 4. As in Fig. 3, but for a fixed effective size parameter of 10 and refractive 
index m varying from 1.3 to 1.6. 



cross-polarized component of the backscattered light relative to that of the 
co-polarized component, can be written as [see (9)] 



jsca _ Qsca ^ - F22 {tt) 

I sea Qsca ~ Fii{tt) + F22{tt) ' 



Similarly, we can consider a fully circularly polarized incident beam with 
Stokes parameters /“°(1,0,0, 1)^ to define the circular depolarization ratio 
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_ _ fii(7r) + F44W _ Fii(7t) - F 22 W 

C Jsca_ysca Fn (tt) — F44 (tt) -^ 22 ( 71 ") 

It is straightforward to show that there is a direct relationship between 
the linear and circular depolarization ratios [cf. (10) and (11)]: 



Sc = 26 l/{1 — Sl)- (12) 

Spherically symmetric particles produce zero depolarization since in this 
case Fii(7t) = F22(7t) [2]. For particles without spherical symmetry, the latter 
equality does not generally hold and such particles can depolarize backscat- 
tered light. This explains why the linear and circular depolarization ratios 
can be direct indicators of particle nonsphericity. It is readily verified [7] that 
Sl never exceeds 1 and 



0<2Sl<Sc< oo. (13) 

Figure 5 shows the results of T-matrix computations of the linear depola- 
rization ratio for a power law size distribution of randomly oriented nonsphe- 
rical particles with a refractive index of 1.311 [8]. For spheroids, e is the ratio 
of the largest to the smallest semi-axes. The shapes of prolate and oblate 
cylinders are specified by length-to-diameter and diameter-to-length ratios, 
respectively. The shape of second-order Chebyshev particles in a spherical 
coordinate system is described by the equation = Rq{ 1 + ecos2'&), 

where e is a deformation parameter specifying the maximal deviation of the 
particle shape from that of a sphere with radius Rq [9]. 

Figure 5 demonstrates that an interesting feature of the linear depola- 
rization ratio for essentially all shapes considered is a rapid increase with 
increasing effective size parameter from 0 to about 5. Furthermore, maximal 
Sl values for most shapes are observed at effective size parameters close to 
and sometimes smaller than 10. Since geometric optics concepts of rays, re- 
flections, and refractions are inapplicable to wavelength and sub-wavelength 
sized particles, our computations suggest that multiple internal reflections in 
very large particles, as discussed by Liou and Lahore [10], are not the only 
mechanism of producing depolarization and not necessarily the mechanism 
producing maximal Sl values. For example, the peak Sl value for polydisperse 
prolate spheroids with e = 1.2 is close to 0.7 and is reached at Xeff as small 
as 12.5. Also, geometric optics predicts size-parameter-independent depola- 
rization ratios for nonabsorbing particles, whereas exact T-matrix computa- 
tions for monodisperse scatterers show strong oscillations of with varying 
particle size [7]. It thus appears that resonance effects in small nonspheri- 
cal particles can be an efficient alternative mechanism of producing strong 
depolarization. 
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Fig. 5. Linear depolarization ratio versus effective size parameter for polydisperse, 
randomly oriented particles of different shapes. The refractive index is 1.311. 



The computations also show that although nonzero depolarization values 
directly indicate the presence of nonspherical particles, there is no simple 
relationship between Sl and the degree of particle asphericity (i.e., ratio of 
the largest to the smallest particle dimensions). Even spheroids with e as 
small as 1.05 (2.5% deviation from the perfect spherical shape) and Chebys- 
hev particles with |e| as small as 0.02 already produce strong depolarization. 
The largest Sl values are produced by prolate spheroids with aspect ratio 
as small as 1.2 and Chebyshev particles with e as small as 0.08-0.10 (8-10% 
deviation from a sphere). Furthermore, Sl for spheroids and, especially, cy- 
linders seems to saturate with increasing aspect ratio. Also of interest is that 
smooth scatterers (spheroids and Chebyshev particles) produce depolariza- 
tion ratios comparable to and even exceeding those for sharp-edged cylinders. 
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3 Multiple Scattering 

Cooperative effects in multi-particle configurations can significantly modify 
the process of polarization and depolarization. In particular, it appears that 
polarization is an intrinsically single-scattering phenomenon which can be 
significantly attenuated by multiple scattering. On the other hand, coopera- 
tive effects cause nonzero depolarization values even for clusters consisting 
of spherical particles and often reinforce depolarization caused by particle 
nonsphericity. 




Fig. 6. Phase function and normalized elements of the Mueller matrix for randomly 
oriented bispheres with touching and separated components. 



We will demonstrate this by considering the simplest configuration com- 
posed of two identical spheres with touching or separated components. Figure 
6 shows the results of T-matrix computations of the phase function and the 
normalized elements of the Mueller matrix for randomly oriented bispheres 
with an index of refraction of 1.5 -I- 0.005i [11]. The size parameter of the 
component spheres is 5 and the distance D between the sphere centers varies 
from twice their radius for touching spheres to eight times their radius for 
’’widely” separated spheres. For comparison, solid curves show the results of 
Lorenz-Mie computations for a single sphere with the same size parameter 5 
and represent the case of a bisphere with infinitely separated components. The 
oscillations in the single-sphere curves are a manifestation of the interference 
structure typical of monodisperse spheres and are traditionally attributed to 
interference of light diffracted and refracted/refiected by the particle [4]. It 
is obvious that the magnitude of these oscillations decreases as the compo- 
nent spheres become closer and is minimal for the bisphere with touching 
components. Since the cooperative effects strengthen as the interparticle di- 
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stance decreases, Fig. 6 unequivocally demonstrates that the most obvious 
effect of cooperative effects in multi-particle configurations is to suppress the 
single-sphere signature in the elements of the Mueller matrix. In particular, 
the degree of linear polarization —F 12 /F 11 becomes more neutral and less 
indicative of the physical characteristics of the component spheres. 




SIZE PARAMETER 

Fig. 7. Linear depolarization ratio versus component-sphere size parameter for ran- 
domly oriented bispheres with touching components. The refractive index is 1.5 -|- 
0.005i. 



The only exception is the ratio F 22 /F 11 , which is identically equal to 1 
for single spheres but increasingly deviates from 1 with decreasing distance 
between the bisphere components. As follows from (10), the deviation of this 
ratio from 1 results in increasingly nonzero values of the linear depolarization 
ratio. Therefore, we must conclude from (10) and Fig. 6 that although single 
spheres do not depolarize light, cooperative effects in multi-sphere aggregates 
can produce significant depolarization. Figure 7 shows as a function of size 
parameter for randomly oriented bispheres with touching components [12] 
and demonstrates rather large 5l values even for the simplest multi-particle 
configuration. 

It is interesting to note that as small a distance between the sphere centers 
as 4 times their radius makes the bisphere Mueller matrix elements very 
close to those of a single sphere (Fig. 6). This result provides an important 
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quantitative criterion of what minimal distance between wavelength-sized 
particles makes them independent scatterers [12]. 

Multiple scattering in optically thick discrete random media has similar 
effects on polarization and depolarization. For example, Hansen and Travis [4] 
use numerically exact adding/doubling computations of polarized radiative 
transfer in a plane-parallel atmosphere composed of spherical particles and 
show that the polarization signature becomes more neutral and featureless as 
the optical thickness of the cloud increases, i.e., when the amount of multiple 
scattering grows (Fig. 8). Similarly, they demonstrate that the polarization 
feature is most pronounced in the first-order-scattering contribution to the 
reflected light and diminishes as more orders of scattering are taken into 
account . 




Fig. 8. Degree of linear polarization of reflected light, —I/Q{%), integrated over 
the disk of a locally-plane-parallel planet uniformly covered with a cloud layer of 
varying optical thickness r and illuminated by an unpolarized beam of light. The 
cloud particles are modeled by a narrow gamma size distribution of spheres with an 
effective radius of 1 /rm and refractive index 1.44. The wavelength of the incident 
light is 0.55 /im. 



The computation of the depolarization ratios for optically thick discrete 
random media is a more complicated problem because the Stokes parame- 
ters of light reflected in the vicinity of the exact backscattering direction are 
affected by a phenomenon called coherent backscattering or weak photon lo- 
calization (e.g., POAN Research Group [13]). This phenomenon is caused by 
constructive interference of light travelling along the same multiple-scattering 
paths but in opposite directions and is not explicitly described by the radia- 
tive transfer equation. However, Mishchenko [14] used the general reciprocity 
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principle to prove that the solution of the vector radiative transfer equation 
[15,16] can still be used to exactly calculate the coherent backscattering con- 
tribution in the strict backscattering direction. Specifically, the total Mueller 
matrix in the exact backscattering direction has the structure given by (6) 
and can be represented in the form 



F = F^ + F^ + F^, (14) 

where is the first-order-scattering contribution, is the incoherent con- 
tribution of orders of scattering 2 and higher, and the matrix F*^ represents 
the coherent contribution and is expressed in the matrix F^ as follows: 



II 

O 


(15) 


Fn = \{Fh + Fi, - + Fl,), 


(16) 


F §2 = 2 ^^22 + ^33 ~ ^ 44)1 


(17) 


F§i = F22 + ^33 + -^ 44 ); 


(18) 


Fg = \{Fg - Fg + Fg + Ffj. 


(19) 


Since the matrices F^ and F^ can be found by solving numerically the 
vector radiative transfer equation, (14)-(19) render the total backscattering 
Mueller matrix. Note that the angular width of the coherent backscattering 



effect is negligibly small for dilute particle distributions such as clouds, but 
is significant for media composed of densely packed particles such as snow or 
sand layers. 

As an example. Fig. 9 shows the linear depolarization ratio versus cosine 
of the angle of incidence fiQ for a semi-infinite medium composed of spheri- 
cal particles with an effective size parameter of 10 and refractive indices 1.2 
and 1.6 [17]. It is seen that Sl is zero in the limit no — >■ 0, i.e., when only 
the light scattered once contributes to the total reflected signal. This first- 
order-scattering contribution is not depolarized because the particles forming 
the medium are assumed to be spherical. However, at normal incidence (/io 
= 1) the multiple-scattering contribution to the total backscattered signal 
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1^0 



Fig. 9. Linear depolarization ratio versus cosine of the angle of incidence for back- 
scattering by a semi-inhnite homogeneous layer composed of polydisperse spherical 
particles with an effective size parameter of 10 and refractive indices 1.2 and 1.6. 



is significant and results in strong depolarization. Figure 10 shows even lar- 
ger values for the circular depolarization ratio. The numerical computations 
suggest that 



< 1 and 5 l < 5c- (20) 

These properties of the depolarization ratios produced by multiple scat- 
tering are similar to those of the depolarization ratios caused by single scat- 
tering on nonspherical particles (cf. Sect. 2 ). This may make it difficult 
in many practical situations to distinguish between the two depolarization 
mechanisms. 



4 Particle Characterization and Remote Sensing 

Figure 11 exemplifies the use of Mueller matrix measurements for particle 
characterization and compares experimental data for a single latex sphere at 
a wavelength of 441.6 nm [2] and results of Lorenz-Mie computations. The 
refractive index of latex is well known (1.588), and it was also known from 
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1^0 



Fig. 10. As in Fig. 9, but for the circular depolarization ratio. 



the manufacturer that the diameter of the latex particle was in the 1 /im 
range. The best-fit agreement between the measurements and computations 
was obtained for a diameter of 1122 nm, while computations for diameters 
1108 and 1136 nm showed large discrepancies [11]. Figure 12 shows a simi- 
lar example but for a single two-sphere cluster in random orientation. In 
this case the best fit was obtained for a component-sphere diameter of 1129 
nm and the estimated accuracy of the retrieval was better than 20 nm [11]. 
These examples demonstrate the potential accuracy of particle sizing based 
on polarimetric measurements (see also Hirleman and Bohren [18]). 

The strong sensitivity of polarization and depolarization on the physical 
characteristics of scattering particles has been widely employed in remote 
sensing studies. A classical example is the analysis of ground-based pola- 
rimetric observations of Venus which enabled Hansen and Hovenier [19] to 
determine the size and chemical composition of Venus cloud particles. This 
work was continued by Kawabata et al. [20], Sato et al. [21] and Knibbe et al. 

[22] , who analyzed polarimetric observations provided by the Pioneer Venus 
orbiter. Mishchenko [6] used computations similar to those shown in Figs. 9 
and 10 to explain the unusually large depolarization ratios measured at cen- 
timeter and decimeter wavelengths for the icy Galilean satellites of Jupiter 

[23] . Mishchenko [24] and Rosenbush et al. [25] have interpreted telescopic 
polarization observations of Saturn’s rings and icy outer planet satellites in 
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Fig. 11. Normalized Mueller matrix elements for a single latex sphere. Solid curves 
show measurements by Bottiger et al. [2] and the dotted curves show results of 
Lorenz-Mie computations. 



terms of the polarization opposition effect produced by coherent backscatte- 
ring and estimated the size of ice grains covering the surfaces of these solar 
system objects. Mishchenko and Sassen [8] used the computations shown in 
Fig. 5 to explain the frequent occurrence of large 5 l values for very young 
aircraft condensation trails (contrails) and concluded that observed increases 
of with the contrail’s age can be explained either by a rapid increase of 
the particle size parameter from less than 1 to about 5 or by assuming that 
the contrail particles originate as perfect spheres and then acquire a certain 
degree of asphericity. Sassen [26] and Aydin [27] review the use of lidar and 
radar backscattering depolarization measurements for characterizing aerosol 
and cloud particles and precipitation. Liou et al. [28] discuss the potential of 
polarimetry in remote sensing of cirrus clouds, while Quinby-Hunt et al. [29], 
Lumme [30], and Hoekstra and Sloot [31] describe multiple applications of po- 
larimetry in remote sensing of the marine environment and in characterizing 
interplanetary dust grains and biological microorganisms. 
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Fig. 12. Normalized Mueller matrix elements for a single latex bisphere in random 
orientation. Solid curves show measurements by Bottiger et al. [2] and the dotted 
curves show results of T-matrix computations. 
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Abstract. Light which has been scattered by a suspension of particles contains 
information about particle motion, number, size and shape. One characteristic of 
such light which is particularly sensitive to shape is its polarisation state. If the 
particles are moving, then this will fluctuate in time and the statistical properties 
of the fluctuations can be used to deduce shape parameters. In this chapter a brief 
review of principles underlying this technique will be given together with the results 
of some preliminary experimental measurements 



1 Introduction 

Statistical analysis of light which has been scattered by systems of particles 
suspended in stationary or moving fluids forms the basis for many well esta- 
blished techniques designed to determine properties of either the particles 
themselves or the flows they seed. Laser anemometers and velocimeters are 
now commonly used to remotely measure wind velocity and other laminar 
and turbulent flows. Instruments for the measurement of particle size and 
dispersity have been commercially available for several years. In the case of 
very small particles the scattered light in these applications may be of such 
low intensity that the methods of photon counting and photon correlation 
spectroscopy are required [1]. 

One characteristic which appears to have received rather little attention 
is particle shape, despite the fact that it plays a significant role in many 
manufacturing processes. However, it is plain from standard texts on the 
subject that light scattering from particles is dependent on this quantity [2]. 
In the case of particles which are much smaller than the wavelength the 
dependence is manifest wholly in the polarisation state of the scattered light 
whilst for larger particles the situation is more complicated and information 
is additionally contained in other measures which vary with the orientation 
of the particles. 

In this chapter a tutorial overview will be given of the principles which 
might be exploited in order to develop a practical technique for particle shape 
determination using light scattering. Quantitative calculations will be restric- 
ted to very small particles so that Rayleigh scattering theory is applicable. In 
this theory each particle is represented by a simple dipole at its centre. The 
approach can be generalised to larger particles but alternative methods then 
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become available and interpretation of experimental measurements becomes 
more complicated. Even for the case of Rayleigh scatterers several different 
methods of analysing the scattered light are possible and optimisation will 
have to be made subject to the practical constraints of instrument design. 

In the next section a scalar random walk model will be used to intro- 
duce the notion of Gaussian and non-Gaussian scattering, corresponding to 
a large or small number of particles being present in the scattering volume 
respectively [3] . Statistical measures which may be determined by experiment 
will be defined. In Sect. 3 polarisation will be introduced into the model and 
quantities of interest calculated for the case of randomly tumbling spheroidal 
particles. Section 4 will review the results of the calculations and a preli- 
minary experiment will be described. Gonclusions will be presented in Sect. 
5. 

2 Gaussian and Non-gaussian Scattering 

When linearly polarised coherent light is scattered by a collection of particles 
which are dispersed over a volume which is much larger than the optical wa- 
velength, the complex amplitude of one polarised component of the scattered 
light in the far field can be represented as the result of a random walk on a 
plane i.e. as a sum of random phasors: 



N 

E = Qn exp (iifin) (1) 

n— 1 

Here, N is the total number of particles present in the scattering volume, 
{a} and {ip} are the amplitudes and phases of the fields contributing at the 
receiving point. For a non-interacting particle system the latter quantities 
are statistically independent and the phases are uniformly distributed. Equa- 
tion (1) is a two dimensional random walk and it is well known that as the 
number of steps becomes large the resultant vector will have statistically in- 
dependent rectangular components which are Gaussian distributed. In this 
situation a homogeneous moving pattern of bright and dark regions known 
as Gaussian speckle may be observed in the far field of the scattering system. 
The quantity observed with the naked eye (or detected by a photomultiplier 
tube for example) is the intensity of the light, I =\E\ . If it is assumed that 
the {a} are statistically identical then the second moment and second order 
coherence function can be calculated from (1) [3]: 





N{a^f 



(2) 
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(a?ai) 

N{al) {al) 



( 3 ) 



where 5 ^ 2 ^ = (-^ 1 ^ 2 ) / \/ i^i) {^ 2 ) and the subscripts refer to different spatial 
positions and/or times. 

It can be seen that when N becomes large the right hand side of the 
first equation reduces to the value 2. This is what would be expected for a 
complex Gaussian process for which the intensity / obeys a negative expo- 
nential distribution. In the same limit, the right hand side of (3) reduces to 
(1) ^ . 

1 -|- g ^2 and is known as the Seigert relation or Reid theorem, being a useful 
factorisation property of Gaussian noise. In the present context the spatial 
structure described by this formula (i.e. the speckle size) is just the diffrac- 
tion lobe width corresponding to the projected scattering volume whilst the 
temporal variations correspond to Doppler shifts determined by the relative 
scatterer motion. 

When the number of particles is small additional terms proportional to 
1 /N must be retained in both expressions. Since the number of particles might 
be expected to scale in proportion to the scattering volume, it is evident that 
deviation from Gaussian statistics will arise when this volume becomes suf- 
ficiently small. In a laser light scattering experiment this can be achieved 
by focusing down the incident beam. Equation (1) indicates that the addi- 
tional terms will depend on properties of the individual particles unlike the 
Gaussian limit where this information is lost. Equation (2) indicates that 
the corresponding intensity pattern will contain new length and time scales. 
These reflect the scattering properties of individual particles and their motion 
and are larger than those characterising the Gaussian term g. 

In a real scattering configuration particles will continually move into and 
out of the scattering volume. This means that the number of particles N will 
be continually changing. The simplest model for this phenomenon is that the 
number fluctuations are purely random being governed by a Poisson distri- 
bution. If, in calculating (2) and (3), averages are carried out over such a 
distribution before normalisation then the terms in N within the first ex- 
pressions on the right hand side vanish, whilst in the final terms N is simply 
replaced by its mean value. This implies that the intensity scattered by a 
small volume of suspension will in practice always fluctuate more than that 
scattered by a large volume. 

In some light scattering configurations particle clustering will be encoun- 
tered; this may be caused by aggregation or turbulent mixing of species, for 
example. It has been shown that this can lead to enhanced non-Gaussian 
fluctuations even when the mean number of particles is asymptotically large 
[4]. A particularly useful non-Gaussian limit distribution occurs if the num- 
ber fluctuations obey a negative binomial distribution. This distribution is 
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associated with the bunching of populations governed by a simple birth- 
death-immigration process. The limiting probability density of the random 
walk (1) is in this case a K-distribution and describes a Gaussian speckle 
pattern with a mean value which is varying in time or space according to a 
chi-square (Gamma) distribution [5]. 

As mentioned in the introduction, for the case of very small particles, the 
amount of scattered light may be so small that photon counting methods are 
required for detection [1]. In the case of classical light fields which concern 
us here, the measured signal can be interpreted as discrete photo-electron 
pulses which would be emitted at random by the detector if the incident 
light had a constant intensity. When the incident intensity varies, the pulse 
rate is modulated and the train of photo-electric events constitutes a doubly 
stochastic Poisson process: 



where 



p{n) 



I J dEP{E){aE)"‘ exp{—aE) 
0 



(4) 



T 

E{T) = i y dtm (5) 

0 

is the integrated intensity, T is the time interval containing the number n of 
photo-electron counts and a is the detector quantum efficiency. The factorial 
moments and correlation function of these counts are related in a simple way 
to the analogous statistical properties of the intensity: 

(n(n- l)(n- 2) •••(n-r-k 1)) _ (£!’') (nm 2 ) _ {hh) 

{nY - {EYYm){n2)~ {h){h) 

However, it is clear that the measured distribution of counts is related 
rather indirectly to the intensity fluctuation distribution. This has important 
implications for the choice of measuring technique. Note that if I is constant 
over the integration time then E = L 

3 Polarisation in the Random Walk Model 

The scattering amplitudes {a} appearing in (1) for the two dimensional ran- 
dom walk are associated with the measurement of a particular polarisation 
component which should be more generally identified by an additional label. 
Thus it is possible to think of correlating the outcome of random walks cor- 
responding to two polarisation states of the measuring apparatus. Assuming, 
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Fig. 1. Scattering geometry. The particle axis of rotation is along n. 



as before, that the particle scattering amplitudes are statistically identical it 
is not difficult to demonstrate that [6] 



where 



and 



(Ij) (4) 



1 + + 




\9jk\ — 



(K-r)(ki^) 




(7) 



(8) 



(9) 



Here the intensities are measured at the same space-time points and the 
subscripts now label the polarisation states of the incident and detected op- 
tical field. Defining subscript p for polarisation vector in the scattering plane 
and s for the case when it is normal to the scattering plane j, k = ss, pp, sp, ps 
for linearly polarised initial and final states. 

The single particle fluctuation factor / and Gaussian fluctuation factor 
g can be calculated exactly for very small particles (Rayleigh scatterers) by 
representing them as simple dipoles [2]: 
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a (X e ■ {aE') (10) 

where E is the incident electromagnetic field, e defines the measured state 
and a is the polarisability tensor. Only the case of spheroidal particles will 
be considered here, for which 



/«i 0 0 \ 

g= 0 «2 0 (11) 

y 0 0 02 / 

For an isolated spheroid with its axis of symmetry aligned along the di- 
rection (Fig. 1) n = (sin ip cos <p, sin ip sin (p, cos ip) it may then be shown that 
[ 6 ] 



a.ss = Eg [(oi — 02 ) sin^ ip cos^ <p + 02 ] 

Opp = Ep [(«! — 02 ) sin Ip cos (()(sin ip sin (p cos 6 + cos ip sin (p) + 02 cos 9] 
aps = Ep («i — 02 ) sin^ ip sin (p cos (p 

a.sp = Eg (oi — 02 ) sin i/) cos 0(sin sin 0 cos 0 -I- cos f/' sin 6*) 

( 12 ) 

where 9 is the scattering angle i.e. the angle between the direction of the 
incident plane wave and that of the detector. In many cases of interest the 
particles will be tumbling so that their orientation is random and governed 
by the distributions: 



p{(p) = (27t) 1 
p{ip) = \sinip 



Q < (p <2 tt 
Q < Ip < TT 



(13) 



These distributions may now be used to calculate the statistical properties 
of the scattering amplitudes (12). For example [6] 



(“ss) _ 5 35r^-|-40r^-|-48r^-|-64r-|-128 

7 (3r2+4r+8)2 

p{agg) = \2^JEg{a-l - a2){agg - U2Eg) 



■n-l 



(14) 



Here r is the polarisation ratio a\ja 2 which, for homogeneous small 
spheroids, is equal to the ratio of their major to minor axes. 



4 Discussion of Theoretical Predictions 

The behaviour of the fourth amplitude moments is illustrated in Fig. 2. Note 
that over some parameter ranges two values of the polarisation ratio give 
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In r 



( 1 ) 

( 2 ) 




Fig. 2. Fluctuation enhancement factors / versus polarisability ratio r for a single 
spheroid, (a) s co-polar 1 and cross polar 2, all scattering angles, (b) p co-polar. 




Fig. 3. Schematic diagram of the probability density of s co-polar returns from a 
single randomly tumbling spheroid. 
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Ass^<Ass> 



Fig. 4. Probability of s co-polar returns from an average number < > of ran- 

domly tumbling prolate spheroids. 



rise to the same degree of fluctuation. However, the probability densities are 
unambiguous. For example, Fig. 3 shows schematically the distributions of s 
co-polar amplitudes returned from single spheroids [7]. Clearly prolate and 
oblate shapes could be distinguished by a measurement of this quantity. In 
practice it would not be possible to maintain a single spheroid in the illumina- 
ted volume. As mentioned before, the number of scatterers would fluctuate. 
Figure 4 shows that provided the mean number is less than about 5 the distri- 
bution of polarisation fluctuations might still provide an unambiguous means 
of shape determination. Unfortunately, in such a non-Gaussian configuration 
the intensity of the scattered light would be very low and photon counting 
methods would be required. The Poisson transform 4 would then tend to 
obscure the detailed shape of the intensity distribution. 

Although all of the fluctuation factors, /, contain information about r, 
they can only be determined through measurements in the small N regime 
where the scattered intensity is low. Particle shaping methods based on non- 
Gaussian polarisation fluctuations are therefore possible but are likely to be 
susceptible to noise and imperfections in the measuring apparatus. On the 
other hand, the Gaussian speckle regime, where many scatterers are illumina- 
ted, is known to be robust and is used in existing particle sizing equipment, for 
example. Unfortunately, the fluctuation factors g are for the most part zero. 
However, there are a few exceptions such as the following cross-correlation 
coefficient between s and p co-polar returns [6] : 



{assapp)'^ _ cos^ 9 [(r - 1)^ -|- 10(r - 1) + 15] ^ 

(oL) (opp) (3^^ -I- 4r -I- 8) [(r - l)2(2cos2 0 -|- 1) -I- 5(2r -|- 1) cos^ 9] 



(15) 
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This formula could form the basis for a practical method for particle sha- 
ping but, in fact, is not directly measurable. However, equivalent information 
can be obtained by cross-correlating two Stokes parameters [8] and this has 
been tested using the experimental layout shown schematically in Fig. 5. This 
is described in detail elsewhere [9] but basically, laser light which is linearly 
polarised at 45° to the scattering plane is incident on the sample. The scatte- 
red s and p components are then detected by two photo-multiplier tubes and 
cross-correlated , normalised and plotted as a function of scattering angle. No 
variation with angle is predicted for spherical scatterers but strong variations 
occur in the case of spheroids. 




Figure 6 shows some preliminary results obtained using specially prepared 
Haematite particles having an r value near 3. Since these were larger than the 
optical wavelength, the Rayleigh theory described above is not valid and the 
theoretical curves shown for comparison were calculated numerically using 
Mishchenko code [10]. Encouraging agreement between the measured and 
predicted behaviour was found. 

The experimental method described above has a number of features which 
make it an attractive candidate technique for particle shaping. In particular, 

it is independent of 
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Fig. 6. Comparison of experimental results with theoretical predictions. 



* the strength of the incident radiation and any variation during the 
measuring time 

* the size and shape of the illuminated volume 

* the absolute transparency of the polarising elements 

* the relative strength of the intensity components after division 

* the detector efficiencies 

The performance characteristics of a number of other closely related con- 
figurations are currently being explored with a view to the development of a 
practical instrument. 



5 Conclusions 

It has been shown that polarisation fluctuations in light which has been scat- 
tered by non-spherical particles contains information about particle shape. 
When the particles are much smaller than the wavelength a simple theoreti- 
cal development can be carried out which predicts quantitative relationships 
between statistical moments, probability densities and the aspect ratio of 
scattering spheroids. These properties can be all be measured in principle, 
but it has been argued that in practice measurements in the Gaussian regime, 
when the scattering volume contains many particles, are likely to prove more 
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robust. Preliminary experimental measurements on characterised spheroids 
are in agreement with experimental predictions and provide encouragement 
for the development of a practical instrument for particle shaping. 
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Abstract. Researchers have studied the statistics of the fluctuations of the scat- 
tered intensity from a small number of non-interacting particles for many years. 
For such samples, the statistics deviate from typical Gaussian behaviour and the 
normalized moments of the fluctuations of the scattered intensity become a fun- 
ction of the mean number of scatterers in the illuminated area. For instance, the 
measurement of the second moment provides a simple method for determining the 
scatterer surface density. In this chapter, we present an extension of these studies by 
including the interaction between the scatterers. In particular, we focus on systems 
consisting of isotropic and non-isotropic particles located on flat substrates. We 
demonstrate that the probability of detecting zeros in the cross-polarized scattered 
intensity can be a useful characterizing tool. 



1 Introduction 



The statistics of the intensity of light scattered by surface and volume dif- 
fusers have been studied for many years. Laser speckle is a very well known 
phenomenon. A theoretical and experimental background of this phenomenon 
can be found in [1]. In most of the contributions contained in this book and in 
other works, it is assumed that the number of scattering centers contributing 
to the total scattered field is so large that, under the scalar approximation, 
the central limit theorem is applicable. In these cases, multiple scattering ef- 
fects are not included, and the total amplitude of the scattered electric field 
can be calculated by means of a random-walk procedure. The statistics follow 
a Rayleigh distribution typical of a random, complex, circular, Gaussian pro- 
cess. The corresponding scattered-intensity statistics can also be calculated 
in a straightforward manner and the result shows that the probability density 
function (PDF) is an exponential whose normalized moments are given by 
the simple factorial law 



in 

(/)- 



= n! 



( 1 ) 



For such Gaussian processes, the normalized second moment takes on 
the value of 2. This means that the signal-to-noise ratio, or the contrast of 
the signal, is unity. For such a low signal-to-noise ratio, it is very difficult 
to get any information about the diffuser from the statistical analysis of 
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the scattered intensity. However, when the size of the illuminated area (or 
volume) is small and only a few scatterers contribute to the total scattered 
field, the statistics no longer remain Gaussian. It then becomes possible to 
extract more information from the scatterer. An excellent review of non- 
Gaussian scattering processes with no scatterer interaction is contained in 
[ 2 ]. 

Non-Gaussian processes are especially interesting when the diffuser is 
composed of discrete scattering centers. This is because it is possible to obtain 
direct information of the density of scattering centers from an experimental 
analysis of the statistics of the scattered intensity, and in particular the nor- 
malized second moment, < /^ > / < / >^ [2]. Until recently, most of the 
studied systems do not take multiple interaction into account. 

The purpose of this chapter is to introduce the reader to the theoretical 
background of non-Gaussian processes when the diffusing targets contain 
discrete scattering centers and multiple interaction is included. The questions 
are how the scatterer interaction changes the statistics of the scattered light 
intensity and what information can be obtained from the analysis of the 
resulting statistics. In particular, we introduce a statistical method based on 
the measurement of the probability of detecting a null intensity in the cross- 
polarized scattered light. We focus our study on surfaces seeded with small 
particles because of their practical applications (surface contamination [3], 
the semiconductor industry [4], Surface Enhanced Raman Spectroscopy [5], 
etc.) and because this can be considered a simple model for more complex 
random natural surfaces [6]. Both isotropic and anisotropic scatterers are 
analyzed. 

This chapter is organized as follows. In Sect. 2, we present the scatte- 
ring model we use along with the mathematical tools and models that are 
used in the numerical calculations. Section 3 provides a theoretical basis to 
understand the intensity fluctuations of the resulting scattered light calcula- 
ted without considering multiscattering effects. Section 4 introduces a simple 
model that accounts for the interaction between particles when they are on a 
substrate. Finally, in Sect. 5 we present an empirical study of the statistics of 
the scattered intensity when scatterer interaction is important. In this case, 
scatterer anisotropy is also considered. 

2 The Scattering Model 

2.1 Scattering Geometry 

The scattering model is shown in Fig. 1. It consists of a flat surface that se- 
parates two semi-infinite media of dielectric constants eiand £ 2 - This surface 
is contaminated with particles, either isotropic or anisotropic, whose size is 
much smaller than the incident wavelength A. If the particles are isotropic in 
nature (spheres), they are characterized by a radius a and dielectric constant 
Sp in such a way that their polarizability a is given by [7] 
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a = 47ra^ 



gp-£i 
£p + 2e 1 



( 2 ) 




Anisotropic particles are characterized by a polarizability tensor ||a||, 
which is diagonal in the coordinate system given by the directions of the 
three main axes of the particle. The three diagonal elements of this tensor, 
ax,cty and a,, are given by [7] 



tti = 



Y 

£l + Li (Sp — £l)’ 



i 



x,y,z 



( 3 ) 



where V is the particle volume and Li{i = x,y,z) are geometrical factore 
satisfying the relation L* + Ly + L* = 1 [8]. 

The system is illuminated by a plane wave at an incidence angle 6i mea- 
sured from the surface normal. The plane of incidence coincides with the YZ 
plane and the scattered intensity is analyzed in this plane at scattering angle 
0g as shown in Fig. 1. We consider two linear polarization states of the inci- 
dent wave, either perpendicular (S’ polarization) or parallel (P polarization) 
to the plane of incidence. Similarly, we consider the same two linear polariza- 
tion (analyzer) states for the scattered wave in the plane of incidence. The 
co-polarized case when both incident and scattered electric fields are parallel 
is referred to as either SS or PP. Similarly, the cross-polarized case when 
they are perpendicular is referred to as either SP or PS. 
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2.2 The Perfect Conductor Approximation and the Image 
Theory 



One of the most common approximations used to solve electromagnetic scat- 
tering problems near rough surfaces is to assume that the surface behaves as a 
perfect conductor (s 2 = —oo). There are several reasons for using this appro- 
ximation. The most important is that the computation is greatly simplified. 
Obviously, the adoption of this approximation must have some physical mea- 
ning, and the results do have greater validity for metallic surfaces than for 
dielectric ones. However, depending on the problem, the optical properties 
of the surface are often irrelevant and we are justified in using the perfect- 
conductor approximation (PCA). For example, the most relevant physical 
aspects of the well-known enhanced-backscattering effect can be explained 
by assuming that the rough surface is perfectly conducting [9] . Similarly, the 
study of the statistics of the scattered intensity by particles on substrates 
does not depend significantly on the substrate refractive index. In this case, 
the use of the PCA has several advantages. For instance, with this approxi- 
mation and for the scattering geometry shown in Fig. 1, the image method 
(IM) introduced by Rayleigh [10] provides exact results for the scattered 
field. According to this method, the scattering problem of one small par- 
ticle located on a flat, perfectly conducting substrate illuminated by a plane 
wave is completely equivalent to having two particles (one the image of the 
other with respect to the substrate surface) illuminated by two plane waves 
(the originally incident wave and its image with respect to the surface plane) 
whose phase relationship is given by the boundary conditions at the perfectly 
conducting interface. Figure 2 gives a clear idea of how the image method 
works. The PCA and the IM constitute the basis of the calculations presented 
throughout this chapter. 




Fig. 2. Two equivalent scattering problems according to the Image Method (IM) 
introduced by Lord Rayleigh [10]. 
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3 Intensity Fluctuations. Non-interacting Particles 

Scattering systems formed by particles or discrete scatterers constitute a 
basic model for studying scattered intensity fluctuations. For the case of 
small, non-interacting particles, under the scalar approximation (only S or P 
polarized light is detected) the scattered electric (magnetic) held amplitude 
in the far-fleld region in a certain scattering direction is given by 

N 

A = exp{i(f,) (4) 

i=l 

where N is the number of particles contributing to the scattered held, Oi is 
the electric (magnetic) held amplitude of the scattered wave by particle i and 
ifi is its phase measured with respect to an arbitrary origin. From Fig. 1, if rt 
and denote the position of particle i (in polar coordinates) with respect 
to this origin, ipi can be written as 



(fii = kvi sin 9s sin 9^. (5) 

where k = 27t/A and the rest of the parameters can be found explicitly in Fig. 
1. We assume that the amplitudes and phases ipi of (4) are independent 
and that the cross-correlation terms {(piipj) and {ctidj) are equal to zero when 
i j. This is a consequence of the assumption that the particles are non- 
interacting. Furthermore, the random distribution of the positions of the 
particles, given by Vi and 9di, are assumed to give the corresponding phases 
ifi uniformly distributed over 27 t radians. If N is fixed in the illuminated area, 
fluctuations in the scattered intensity are due to the randomness of and 
ifi when we consider different samples by illuminating different parts of the 
scattering surface. It has been shown [II] that the probability of detecting a 
scattered intensity between I and I + dl {I on \A^) can be expressed as 

P{I)dI=-j^ d^dl (6) 

where Jq is the Bessel function of zero order. For brevity, we do not show the 
details of the derivation of (6), which can be found in [11]. This equation is 
obtained under the assumption that the individual scattering amplitudes, a^, 
of (4) are fluctuating (() in (6) means an ensemble average over the different 
distributions of a^), statistically independent and statistically identical. The 
calculation of the moments of the distribution function of (6) is straightfor- 
ward and gives the following results for the first two moments 



{I) = N{a^) 



(7) 
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N {a^f 



(8) 



Apart from the obvious result given by (7), (8) for the second moment 
indicates that when N is fixed and remains small enough, fluctuations in 
the scattered intensity can be enhanced due to fluctuations in the scattering 
amplitudes a^. Enhancement is only possible if the scattering centers are 
anisotropic in nature and randomly oriented (since we are considering small 
particles). This enhancement depends not only on the number of scattering 
centers (i.e., the size of the illuminated area) but also on the statistics of Ui. 
However, for spherical symmetry (o"^) = (a^) , so 



(/2)/(/)" = 2-1 (9) 

and instead of observing enhanced fluctuations, we get reduced fluctuations 
(i.e., (P) / (I)'^ < 2) and the speckle contrast is considerably reduced as 
fV — >■ 1. This means that as — >■ 1, it becomes harder to see a clear distin- 

ction between maxima and minima in the speckle field. In any case, when 
N tends to infinity, the probability density function P{I) tends to an expo- 
nential, and its second factorial moment tends to the value of 2, typical of a 
Gaussian statistical process. In this case all the information concerning both 
the number of scattering particles (both terms of (8)) and their scattering 
properties (second term of (8)) is completely lost. On the other hand, in the 
non-Gaussian regime, i.e. when N is small in the illuminated area, the mea- 
surement of {P} / (7)^ can provide information not only about the particle 
surface density but also about how these particles scatter the incident light 
beam, and therefore, about the particles themselves. 

An interesting problem arises when the number of particles in the illumi- 
nated area is not fixed but fluctuates. For instance, if the geometry shown 
in Fig. 1 is illuminated by a small spot of light (in principle, we assume an 
illuminating beam with a hard-edge profile which, for our calculation purpo- 
ses, can model a real Gaussian beam profile) and we get scattering intensity 
samples by scanning the illuminating spot over the surface, the number of 
illuminated particles will fluctuate from region to region. If the particle sur- 
face density p is uniform, the fluctuations of the particle number N can be 
modeled very well by a Poisson statistical process with mean 



N = 7rpi?^/ COS 9i, (10) 

R being the illuminating spot radius. The new expression for P{I) can be 
found by averaging (6) over a Poisson process with the mean given by (10) 
[11] (only isotropic particles are considered) 
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P(/)c;/ = 1 J (JoiC'/l) exp {N[Jo{Ca)-l]}dCdI (11) 

whose second factorial moment (the first moment is not interesting since it 
is simply proportional to N) is given by 



{l^)/{lf = 2+l/N (12) 

For isotropic scatterers, it is interesting to compare (9) and (12) and 
to observe that particle number fluctuations enhance those of the scattered 
intensity in such a way that in contrast to what happens for fixed N, the 
speckle contrast increases as N decreases. 

4 Particle Interaction. A Simple Model 

The main objective of this chapter is to show how the statistics of the scat- 
tered intensity from surfaces like that of Fig. 1 change when the interaction 
between the scatterers is included. In this section, we introduce a simple mo- 
del for analyzing the multiple scattering effects on the scattered intensity in 
the plane of incidence: changes in the co-polarized light, appearance of cross- 
polarized light, etc. A knowledge of the interaction between the scatterers, 
its range, its strength, etc., is the first step towards understanding the final 
and most important part of this chapter. 

The basis of the model consists of three elements: the perfect-conductor 
approximation (PCA), the image method (IM) (see Sect. 2.2) and the coupled- 
dipole method (CDM), also known in the literature as the discrete-dipole ap- 
proximation (DDA) (see the Introduction chapter). As previously outlined, 
the adoption of the PCA for the substrate allows us to use the IM as an exact 
method for calculating the scattered far field. As a result, the geometry of 
Fig. 1 can be replaced by one of two parallel planes of particles separated by 
twice the radius of the particles (for isotropic scatterers) and illuminated by 
two plane waves, one the image of the other taking into account the boundary 
conditions at the perfect conductor interface. The CDM is now introduced 
not only to consider the interaction between a particle and its neighbors, 
but also the interaction of a particle with itself due to the presence of the 
substrate. 

The CDM has been widely and successfully used to calculate the scattered 
field from particles with irregular shapes, for which an analytical approach 
to the electromagnetic problem is impossible. For the cases we are analyzing, 
this method is especially advantageous because, for the geometry of Fig. 1, the 
problem of calculating the local field in every particle (dipole) is simplified, 
i.e. 2N dipole units {N is the number of illuminated particles) distributed 
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randomly in two parallel planes, one the image of the other, separated by a 
distance 2a, a being the particle radius. The local electric field in particle i 
is given by 



2N 

where 1^ is the incident electric field on the ith particle, ||a|l^ the polariza- 
bility tensor of the particle j and is the unit vector from the jth to the 
ith particle. In (13), Aij and Bij are the interaction factors given by 




Aij — 



1 



ik 



k ~^ + — 



exp {ikrij) 



(14) 



Bij — 



3 , 2 3zfc 

-IT -k^ 



exp (ikrij) 



(15) 



In these equations, rij is the distance between the particle i and the 
particle j, and k =2ii j\. Equation (13) can be written in a compact form as 



2N 

t = ( 16 ) 

with ||C|ljj representing the interaction matrix between scatterers i and j. 
The problem now is reduced to solving the system of 6iV coupled linear 
equations given by (16) with 6N unknowns, Ei (i = 1, ...2N), which represent 
our final solution: the local field in every particle. One of the advantages of 
having the local fields is that it is possible to calculate the scattered field at 
every point in space. In particular, the total scattered far-field is given by 



e: = 



k^ exp(ikR) 

R 



2N 




i=l 



(17) 



where ||/|| is the unit matrix, R is the distance from the sample to the detector 
(kR ^ 1) and nt is a unit vector indicating the scattering direction. Equation 
(16) may be rewritten in matrix form as 



^ = (ii/ii-nr'^ 



(18) 
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SO the main calculation problem is the inversion of the complex matrix, |1/|| — 

ni- 

As an example of the application of the coupled-dipole method to analyze 
particle interaction, let us consider the scattering system of Fig. 1 with only 
two spherical particles separated by a distance d and located on a flat perfec- 
tly conducting substrate. This system is illuminated by a plane wave (S' or 
P polarized) whose propagation direction is given by the angle of incidence, 
9i. The scattered held is measured in the plane of incidence at a scattering 
angle 9s by means of an analyzer whose transmission direction is either per- 
pendicular or parallel to this plane, i.e., we observe either the S or the P 
component of the scattered held respectively. The interaction effect between 
the two scatterers is a function of their separation d, the angle of incidence 
9i, and the polarization of the incident radiation. In order to quantify this 
effect, we define the depolarization coefficients given by 



Ds {9„d) 




{Isp {9i,9s,d)) d9s 



{Iss {9t,9s,d)) d9s 



Dp{9„d) 




{Ips {9i,9s,d)) d9s 



{Ipp {9i,9s, d)) d9s 



(19) 



(20) 



where () stands for the average over all the possible relative orientations of 
the two particles. Because the scatterers have spherical symmetry and the 
scattered intensity is measured in the plane of incidence, the coefficients Ds 
and Dp are a measure of the amount of depolarized scattered light which 
appears in the plane of incidence due to particle interaction; i.e., they are a 
measure of the interaction. 

Fig. 3 shows Ds and Dp as a function of the separation d between the two 
particles for two incident angles 9i = 0° and 60°. For comparison we have 
also included results when the substrate is absent in order to analyze the 
effect of its presence on the depolarization coefficients. The common feature 
of all these plots is that both Ds and Dp decrease as d increases. This is an 
obvious conclusion because the interaction is weaker when the particles are 
farther apart. However, it is important to point out that, with the substrate, 
the values of Ds and Dp are smaller than those with no substrate under the 
particles. Furthermore, the decrease in the depolarization coefficients is faster 
when the particles are very close (d < O.IA) than when they are far apart 
(d > 0.4A). This means that the interaction effect only plays an important 




d/^ 



The depolarization coefficients Ds (crosses) and Dp (circles) (see (19) ai 
a function of the relative separation, d/X between the two scatterers, (e 
O.OOli) for two angles of incidence, (a) di — 0° and (b) — 60°. Continuo 

;h substrate: broken line, without substrate. 
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role when the particles are very close. This does not happen when there is 
no substrate. A detailed explanation of this can be found in [13]. 

Another interesting conclusion from Fig. 3 is that the presence of the 
substrate makes Dg and Dp behave differently. In particular the interaction 
is weaker for P incident polarization than for S incident polarization and 
decreases as the angle of incidence increases when the polarization of the 
incident light is parallel to the plane of incidence (P). When there is no sub- 
strate Dg and Dp show the same evolution regardless of both the separation 
between the particles and the angle of incidence. 

Most of these results have been checked experimentally by the authors of 
this chapter and the reader may find further details in [14] and [15]. 

5 Intensity Flnctnations. Interacting Particles 

In Sect. 3 the statistics of the light scattered by an ensemble of non-interacting 
particles has been briefly reviewed. In Sect. 4 the interaction between par- 
ticles on a substrate has been analyzed. Now we have the tools to consider 
the central question of this chapter: how do the statistics change when the 
particles are interacting? In this section, we consider both isotropic and ani- 
sotropic particles and present the statistics for both co- and cross-polarized 
scattered light. Because of the theoretical complexity of the proposed pro- 
blem, we resort to an empirical study through numerical simulations based 
on the IM and the CDM to extract the most interesting conclusions. 

5.1 Isotropic Scatterers 

a) Co-Polarized Scattered Light To study the effects of particle interac- 
tion, we have carried out some numerical simulations neglecting and including 
the effects of this interaction (see Fig. 4). The shape of the PDF of the co- 
polarized scattered intensity obtained in both cases is very similar, but some 
interesting differences appear. First, the PDF peaks are smoother. Second, a 
long tail appears for intensity values greater than iV^Ji (Ii is the scattered 
intensity from one particle), which reaches very low probability values. This 
means that, in a series of intensity measurements, some peaks with very high 
intensity values occasionally appear. The strength of particle interaction has 
been varied by modifying the optical properties of the particles. For example, 
the closer they are to their resonance condition (ep=-2 , as can be deduced 
from (2) for £i=l), the more light they scatter and so the influence of one 
particle on its neighbors increases. We observed that intensity peaks greater 
than N'^Ii, which show up as a consequence of particle interaction, increase 
in frequency and magnitude as the interaction between the particles increa- 
ses. When this interaction is strong enough, the appearance of more frequent 
peaks with high values of the scattered intensity modifies the statistics sig- 
nificantly. In this case it is not possible to identify any characteristic shape 
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of the PDF, and its moments and their relative errors increase. These errors 
could be reduced by means of a good characterization of the PDF in the range 
of intensities I > However, the number of samples needed would be 

very high, and this is unpractical both from the numerical and experimental 
point of view. 

0.1 



0.08 



FT- 0.06 

— p 
I — I 

Cu 

0.04 



0.02 



0 

0 1 2 3 4 5 6 

I/I 

1 

Fig. 4. PDF of the copolarized scattered intensity for two particles located on a 
flat substrate, neglecting (broken line) and including (continuous line) interaction 
effects 




b) Cross-Polarized Scattered Light Scattered light from spherical par- 
ticles in the plane of incidence whose polarization state is perpendicular to 
the incident one appears as a result of particle interaction. We show in Fig. 
5 the cross-polarized intensity PDF when the mean number of illuminated 
particles is = 5 (A^ is Poisson-distributed) for spherical particles with ra- 
dius a = 0.05A and dielectric constant £^=(-2.01,0). For all simulated cases, 
the statistics of the intensity fluctuations for the cross-polarization state have 
the same appearance. As can be seen, P(/cross) has a maximum at /cross = 0 
and decreases dramatically as the intensity increases. We also observe that 
/’(/cross) remains finite even for values of /cross much greater than the mean 
scattered intensity. This means that, when looking at the fluctuations, some 
peaks of very high intensity occasionally appear which are caused by the pre- 
sence of two or more particles very close to each other. The corresponding 
moments also have very high values, and their relative errors always exceed 
90%. Therefore, the information we may obtain by analyzing them is not 
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trustworthy. As an alternative, we propose measuring only one specific value 
of P(/cross)- In particular, this measurement was chosen for P(/cross = 0) 
because at that point the PDF (Fig. 5) shows the smallest relative error 
and also because this parameter can be used to quantify particle interaction: 
P{Icioss = 0) = 1 when there is no particle interaction, and decreases as 
interaction increases. 

We have developed a very simple model to study the dependence of 
P(/cross = 0) with the number of illuminated particles. We assume that the 
particle surface density is constant on the surface (iV is a Poisson-distributed 
random variable). We also assume that there is a threshold intensity Iq be- 
low which we cannot detect any scattered intensity, so that for /cross < /o, 
we consider /cross = 0 (this is indeed the case when carrying out any real 
experiment in the laboratory). If a particle scatters a cross-polarized inten- 
sity /q when it is separated by a distance L from its nearest neighbor, the 
probability of detecting zero cross-polarized intensity from that particle is 
the probability of it being separated from its nearest neighbor by a distance 
greater than L, 




Fig. 5. PDF of the cross-polarized scattered intensity for spherical particles with 
a = 0.05A and ep = (-2.01,0). N = 5. 



Pzilcross = 0) = Pi{d > L) = [ - exp{-al)dl = exp(-crL) (21) 

Jl o- 

where a can be identified as a particle linear density on the surface (ct = 



R being the radius of the illuminated area). The probability of 
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getting zero cross-polarized scattered intensity when the number of illumina- 
ted particles is N is then 



P{lcTOS. = 0 ) 



[Pi{Icross = 0)]^ = exp 




( 22 ) 



The parameter L, which is the maximum interparticle distance at which 
interaction effects can be detected, can be determined by performing a curve- 
fit carried out for surfaces contaminated with the same kind of particles. This 
parameter should depend not only on the particle characteristics (size, shape 
and optical properties) but also on both the illumination and detection para- 
meters (incidence and scattering angles and polarization of the incident and 
scattered radiation) . This model explains the behavior observed in our nume- 
rical results. As an example, we show in Fig. 6 the dependence of P(/cross = 0) 
with N, when the particles have a dielectric constant £p=(-2.01,0) and radii 
a = 0.05A (crosses), 0.07A (squares) and O.IA (circles). The marked points 
correspond to the simulated data and the curves show their fit to (22). The 
calculated values for L are 0.33A, 0.79A and 0.83A, respectively. This result 
is consistent with the model because it means that as particle size increases, 
the maximum interparticle distance at which multiscattering effects can be 
detected also increases. 




0 2 4 6 8 10 

N 

Fig. 6. P(/cross = 0) as a function of N for substrates contaminated by particles 
with different radii a and tp = (—2.01,0). 
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Interest in this model is more theoretical than practical. This is because 
in the laboratory it is very difficult to get a set of flat substrates contamina- 
ted with the same kind of particles, with different, homogeneous and known 
particle surface densities. Usually there is only one sample from which we can 
extract as much information as possible. It is then more practical to express 
(22) in the form 

P(Icross = 0) = exp [- (Lttct^) R^] = exp{-"fR^) (23) 




Fig. 7. P(7croas = 0) as a function of the radius of the illuminated area R, for two 
surfaces contaminated by spherical scatterers ( a = 0.05A and ep = (—2.01,0)) with 
different particle surface density, a. 



Therefore, in these cases we should measure the dependence of P(/cross = 
0) with the illuminated area. Figure 7 shows P(/cross = 0) as a function of the 
radius of the illuminated area R for two different surfaces with the same kind 
of particles (spherical, with a = 0.05A and £^=(-2.01,0)) whose linear particle 
densities are a = 0.84 and a = 0.65 particles/A. The marked points are the 
simulated results, and the curves show the fit to (23). Following this method, 
we can calculate L for a given value of cr (in our case, the relative error in 
this calculation is 1% with respect to the previously obtained value) and vice 
versa. This fit is also useful to determine the linear particle density ratio 
between two different surfaces contaminated by the same kind of particles. 
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According to (23), this ratio is 



O'! 




<J2 



\l2j 



(24) 



From the numerical results shown in Fig. 7, we obtain cti/(T 2 = 1.312. 
This allows us to calculate <J 2 if ui is known, with a relative error of 1.6%. 
Once we have a well characterized sample, it can be used to determine the 
linear particle density of other samples contaminated by the same kind of 
particles. 



5.2 Non-isotropic Scatterers 

In this section we examine the fluctuations of the scattered light from flat 
substrates contaminated by spheroidal particles. Their particle shape is cha- 
racterized by the ratio between their two main axes, r = dxjdi- Depending 
on this value, three different kinds of particles can be distinguished: spherical 
particles (r = 1), oblate particles (r < 1) and prolate particles (r > 1). In the 
calculations presented in this section, the particles have a dielectric constant 
£p=(-2.01,0) and di = 0.05A {di varies with r). We also assume that as the 
particles settle down on the substrate, prolate particles have their largest axis 
parallel to the substrate, while the shortest axis of the oblate particles aligns 
perpendicularly to the substrate. 



a) Co-Polarized Scattered Light Figure 8 shows the PDF of the co- 
polarized scattered light for oblate particles (on the left) and prolate particles 
(on the right), when the number of illuminated particles is assumed to be 
constant: N = 2 (two upper graphs) and N = 3 (two lower graphs). The 
scattered intensity has been normalized to its mean value. These results are 
quite different from those presented by Bates et al. (compare them with 
Fig. 1 of [16]), who studied the fluctuations of the scattered intensity from 
an ensemble of spheroidal particles, disseminated in a volume, and whose 
interaction was neglected. The difference is due to the effect of the presence 
of the substrate on the particle orientation; this gives them a preferential 
alignment. If we eliminate the substrate, allowing the particles to have an 
arbitrary orientation; we recover the results presented in [16]. 

We observe in Fig. 8 that oblate particles present the same PDF inde- 
pendently of the value of r, and this PDF coincides with the PDF obtained 
for spherical particles (r = 1). This is a direct consequence of the alignment 
imposed by the presence of the substrate. Because of this alignment, every 
particle scatters the same amount of light, so that their superposition only 
depends on the relative positions of the particles (as was the case with spheri- 
cal particles) . The only difference in the light scattered from oblate particles 
with different r is the value of the mean intensity, which cannot be seen in 
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Fig. 8. PDF of the copolarized scattered intensity for oblate (two left plots) and 
prolate (two right plots) particles. N — 2 for the two upper plots and N = 3 for 
the two lower ones. 



Fig. 8. When r decreases, di (and consequently ai) decreases; therefore, the 
amount of scattered light from each particle also decreases which makes the 
total scattered intensity smaller. The result is quite different for prolate par- 
ticles on the substrate. In this case the resulting PDF is different from that 
obtained for spherical particles, and its shape depends on the parameter r. 
This is because in this case the amplitude of the total scattered field de- 
pends not only on the particle relative positions, but also on the orientation 
of each particle on the substrate. As a consequence, the shape of the PDF is 
smoother. 



b) Cross-polarized scattered light Oblate particles have a polarizability 
tensor that is diagonal in the laboratory system because of their alignment on 
the substrate. This means that if there is only one particle on the substrate, 
it cannot scatter light with polarization perpendicular to that of the inci- 
dent beam, so this component can only appear as a consequence of particle 
interaction. The behavior of the fluctuations of the cross-polarized scattered 
intensity must then be analogous to that observed for spherical particles. We 
have checked that P(/cross) £^nd its normalized moments really do show the 
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Fig. 9. P(/cross = 0) for oblate particles of different shapes (different values of r) 
as a function of N. The marks show the simulated data and the lines, their fits to 
the model (see (22)). 




same behavior described for spherical particles and so all the previous con- 
clusions can be extended to substrates contaminated by oblate particles. We 
can also extend the model to study the dependence of P(/cross = 0) on the 
number of particles in the illuminated area. Figure 9 shows this evolution 
for oblate particles with different shape ratios r. The fitted values for the 
parameter L show that they increase as r increases. This is because when r 
increases, the height of the dipole (located at the center of the particle) and 
the polarizability component perpendicular to the substrate also increase and 
so particle interaction becomes stronger. 

The polarizability tensor of prolate particles has non-null off-diagonal ele- 
ments in the laboratory system; therefore, these particles are able to scatter 
cross-polarized light in the plane of incidence by themselves, even when there 
is no particle interaction. Figure 10 shows the PDF of the cross-polarized 
scattered intensity for prolate particles of different shapes when the number 
of illuminated particles is assumed to be constant. As predicted, its behavior 
is quite different from that observed for both oblate and spherical particles. 
The PDF shape does not vary with the ratio r, and so it is not possible to 
distinguish between surfaces contaminated by prolate particles with different 
shapes. In contrast to what happens with oblate particles, the normalized mo- 
ments of this distribution can be determined with a very small relative error. 
To distinguish between surfaces with prolate and oblate particles, measuring 
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Fig. 10. PDF of the cross-polarized scattered intensity from prolate particles for 
different values of r. Part (a) corresponds to N = 2 and (b) to N = Z. N assumed 
constant. 



these moments is probably more effective than analyzing the PDF shape of 
the co-polarized intensity. This is because the shape of these PDFs is very 
smooth, so the recognition of characteristic features is a difficult task. 

Figure 11 shows the dependence of the second (upper graph) and third 
(lower graph) normalized moments as a function of the mean number of il- 
luminated particles N . The dependence does not compare well with that 
predicted in (12) because this equation was deduced for isotropic particles 
(spherical or oblate particles when they are aligned on the substrate). When 
the particles are prolate, the amplitude of the scattered field from each par- 
ticle depends on the orientation of the particle with respect to the reference 
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Fig. 11. and ri'^^ of the fluctuations of the cross_polarized scattered intensity 
from prolate particles as a function of N for different values of r. The horizontal 
lines show the values predicted for Gaussian statistics. 

system. This modifies the statistics and consequently the corresponding mo- 
ments [17]. 
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Abstract. One of the two fundamental mechanisms underlying Surface- Enhanced 
Raman Scattering (SERS) is the existence of large electromagnetic (EM) fields 
in the vicinity of the rough metal substrates that are used as substrates. Sur- 
face roughness below the micron scale plays a relevant role in this process, due 
to the roughness-induced excitation of surface-plasmon polaritons. Since in many 
scattering configurations dipolar and/or electrostatic approximations cannot be 
employed, we study this EM mechanism from the rigorous standpoint of classi- 
cal Maxwell equations. By means of numerical simulation calculations based on 
the Green’s theorem integral equation formulation, the linearly polarized electro- 
magnetic field scattered from one-dimensional, randomly rough metal surfaces is 
obtained. In particular, Ag, An, and Cu surfaces are considered possessing fractal 
properties analogous to those observed in colloidal aggregates or coldly deposited 
films commonly used in SERS experiments. We analyze the influence of the roug- 
hness parameters on the near field intensity. The enhancement factor of the SERS 
signal is assumed to be proportional to the square of that of the near field intensity 
at the pump frequency; in light of the random nature of the roughness, the ana- 
lysis is performed on the probability density function of the enhancement factor. 
The optimum pump frequency is obtained from the spectral dependence of both the 
average field enhancement and the absorption. In addition to the near field intensity 
calculations, the far field scattered from such random self-afRne fractals is studied, 
revealing interesting features in the angular distribution, such as incoherent peaks 
at the specular direction, and weak backscattering peaks for the rougher surfaces 
that also yield large near field enhancements. 



1 Introduction 

Raman spectroscopy is a very useful technique to study the structure of all 
kind of materials. However, the weakness of the Raman signal is a drawback 
that forces the experimentalist to use high-power light and extremely sensitive 
detectors. During the 60’s, the discovery of lasing gave an enormous impulse 
to the technique owing to the dramatic increase in the radiation intensity on 
the sample. Later on, in the mid 70’s, Raman spectroscopy underwent a se- 
cond revolution with the observation of Surface-Enhanced Raman Scattering 
(SERS) coming from a molecule adsorbed onto a rough surface (see Fig. 1). 
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Metal: Ag, Au, Cu 



Fig. 1. Illustration of surface-enhanced Raman scattering. 

SERS was first observed by Fleischmann et al. [1] for pyridine adsorbed at 
silver electrodes. Then Jeanmaire and R. P. Van Duyne [2] and Albrecht and 
Creighton [3] independently suggested that this high increase in the inelastic 
Raman signal is produced by the intensification of the electromagnetic (EM) 
field near the metal surface of the electrode where the pyridine is adsorbed. 
Typically, SERS enhancements factors with respect to conventional Raman 
of six orders of magnitude were accounted for. Thus far the origin of the 
SERS phenomenon is not entirely clear, but it is generally accepted that an 
EM enhancement mechanism is to a large extent responsible for it [4]- [8]. 
(In addition to the EM mechanism, other effects, such as resonance Raman 
scattering [9] or chemisorption [10], can have a multiplicative impact upon 
the SERS signal.) 

The amount of theoretical work dedicated to the EM mechanism in SERS 
is enormous. It is out of the scope of this chapter to review all the existing 
theories and models; for that purpose, there are various excellent review ar- 
ticles [4]- [8], [11] in the literature, which refer to SERS theories in certain 
depth. It is interesting to note, however, that most of the numerous theo- 
retical works devoted to the description of the EM mechanism rely on the 
electrostatic and/or dipolar approximations, and use as surface models extre- 
mely simple geometries. The common qualitative picture to all those models 
is the fact that the EM field enhancement (FE) stems from the roughness- 
induced excitation of surface-plasmon polaritons [12] (SPP) by the incoming 
light, resulting in large EM fields in the vicinity of the metal surface that 
strongly polarize the adsorbed molecules. A similar EM enhancement is as- 
sumed to occur at the Raman-shifted frequencies, which combined with the 
former direct enhancement may give rise to an overall FE factor on the or- 
der of or above 10"^. A SPP can be either propagating along a continuous 
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surface (extended SPP), confined within metal particles (particle plasmon 
resonances), or even localized due to Anderson localization (localized SPP); 
of course, some of these effects may take place altogether on real SERS metal 
substrates. 

Moreover, from all of these studies and abundant experimental evidence, 
it has been commonly accepted that a rough surface on which the adsorbate 
is located (within a few amstrongs) is crucial to observe SERS. Several kinds 
of rough metal surfaces have been employed as SERS substrates: electrodes, 
films, and colloids. All of them exhibit microstructures, being even necessary 
the presence of corrugation of scale of the order of or below a tenth of a 
micron. Cold deposited silver islands films and silver or gold colloids are 
the most widely used. In the case of colloids, their aggregation is necessary 
to obtain SERS effect [13]- [16]. Interestingly enough, the internal structure 
of these films and aggregates can be in most cases characterized in terms 
of fractal properties: self-affinity in the case of films [17] and self-similarity 
in the case of aggregates [14,15]. In general, many aggregation and surface 
growth processes create such structures [18,19]. 

Therefore, inasmuch as the quantitative evaluation of the surface EM 
field is central to the SERS effect, a rigorous knowledge of the EM scattering 
process for surface models as realistic as possible is evidently needed. In recent 
years, the theoretical efforts have been directed towards either describing 
approximately realistic surface models [14], [20]-[22], or using the full EM 
theory to study simplistic surface models (though introducing increasingly 
complex properties) [23]; in addition to that, some efforts have been made to 
combine both the rigorous EM theory with complex, realistic surface models 
[24]-[26]. 

In what follows, we will review the latter works, which address the issue 
of the direct EM enhancement on continuous, fractal metal surfaces from the 
rigorous standpoint of classical electrodynamics. To that end, we present in 
Sect. 2 the exact integral equations formulation of the EM wave scattering, 
not restricted to the limitations of electrostatic and/or dipolar approxima- 
tions. This formulation has been successfully employed through numerical 
simulation calculations to study the far field scattered from one-dimensional, 
randomly rough, either metal or dielectric, surfaces [27]-[29]. In Sect. 4, we 
will apply it to calculate the linearly-polarized EM field in the vicinity of 
self-affine fractal surfaces (described in Sect. 3) of Ag, Au, and Cu, simi- 
lar to those mentioned above that are used as SERS substrates (cf. [24,26]). 
Section 5 analyzes in detail the occurrence of EE and the dependence on 
both roughness parameters (fractal dimension and rms height deviation) and 
excitation wavelength (cf. [24,26]). The angular distribution of the far field 
intensity scattered from such metal surfaces (cf. [25]) is studied in Sect. 6. 
Finally, the main advances of all these works are summarized in Sect. 7, 
and some guidelines on ongoing work and foreseen perspectives included in 
Sect. 8. 
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2 EM Scattering Theory 



Let us model the geometry associated with typical SERS experimental con- 
figurations exploiting metal surfaces by means of a one-dimensionally rough, 
vacuum-metal interface 2 = C(2^)- The metal occupying the lower half-space 
[z < C(a;)] is characterized by an isotropic, homogeneous, frequency-dependent 
dielectric function e''(oj). This surface is illuminated by a monochromatic, li- 
nearly polarized Gaussian beam of frequency w = 27t/A and half- width W 
incident at an angle 0q measured counterclockwise with respect to the positive 
z axis. 

The restrictions to one-dimensional surfaces and linearly polarized inci- 
dent beams have the advantage that notably simplify the formulation, while 
still rigorously considering the scattering processes relevant to interpret the 
SERS EM mechanism. In fact, under such conditions, we can reduce the 
initial three-dimensional vectorial problem to a two-dimensional scalar one, 
where the EM field is entirely described by the j/-component [denoted by 
Uis{r,Lo), with r = {x,z)\ of, respectively, the electric field for /? = s pola- 
rization (transversal electric) and the magnetic field for /3 = p polarization 
(transversal magnetic). It should be emphasized that this simplification, un- 
like the case of straight scalar formulations, stems only from the scattering 
geometry, so that the full vector, EM character of the problem is preserved. 

Our rigorous calculation method is based upon the exact integral equa- 
tions formulation of the above mentioned scattering problem [26]- [29]. This 
formulation can be briefly summarized as follows. On the basis of the Helm- 
holtz equations satisfied by the field in the upper U'^{v,lo) (z > () and 
lower U^{r,oj) (z < () semi-infinite half-spaces, and by applying the Green’s 
theorem and recalling the radiation conditions at infinity, we are led to the 
following four integral equations: 
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II 


w), z < C(a;); 


(4) 



where r' = {x' , z' = C(3^0) the normal derivative is djdn = (h • V) [with 
n = 7 “^(— ^'(cc), 0, 1) and 7 = (1 -|- (C'(a;))^)^/^]. The Green’s function G is 
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given by the zeroth-order Hankel function of the first kind. In our work, the 
incident field, as mentioned above, is given by the Gaussian beam [27]: 



z),u>) = exp t — (xsin^o — zcosdo)(l + w{x, z)) 



CO 
I — 
L C 



X exp —W ^(xcos^o + -zsin^o)^ , 



j{x,z) = 






[2W ^(xcos0Q + -zsin^o)^ — l] • 



( 5 ) 

(6) 



The integral equations (4) enable us to obtain an exact solution for the 
scattering of a s- or p-polarized electromagnetic wave. A set of two coupled 
integral equations can be obtained by using (1) and (3) as extended boundary 
conditions with the aid of the continuity conditions across the interface: 



Up{Tc,uj) L=c<+’G) = L=c<-)g)- C^) 







1 
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dn 
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dn 





with = 1 and and = lim£_>o(C(a:) ± e). As shown 

in detail in [28], upon numerically solving the resulting system of integral 
equations for the surface field and its normal derivative, which play the role 
of source functions, the scattered field in the entire space can be calculated 
from (1) and (2). Therefore, for our purpose, the exact, self-consistent surface 
field results directly from one of the source functions, and then (1) will be 
used to calculate such total field at any other position r in vacuum z > 
Note that this total field consists of the incident field plus the scattered 
field, the latter being in turn given by the surface integral in the left-hand 
side of (1). We will deal in the numerical calculations with the former surface 
integral as shown in (1), and also with its far field expression, which yields the 
angular distribution of scattered light (Og being the scattering angle measured 
clockwise with respect to the positive 2 : axis): 
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(9) 



With regard to the near EM field, apart from the resulting total field 
intensity 



I/3{r,uj) =|C/^(r,a;)|^ 
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we define the EM field enhancement factor as the normalized intensity: 



CT/3(w) 






(10) 



(Strictly speaking, ap makes full sense for r in the region illuminated by the 
central spot of the incident beam.) Relevant to the SERS signal is the product 
of EE at the excitation and Raman frequencies (respectively, w and lo') which 
can be approximated by the square of the EE at the pump frequency [23,26]. 

The numerical implementation of the above given formulation relies on 
the conversion of the integral equations into linear equations: By means of 
a quadrature scheme the surface is truncated to a length L consisting of N 
sampling points [28]. In practice, for every realization of the random surface, 
it reduces to solving a system of 2N complex linear equations for the source 
functions, followed by a vector-matrix multiplication to obtain the scattered 
field at each position in the vacuum half-space. Furthermore, when statistical 
quantities such as ensemble averages ((...)) or probability density functions 
(PDF) are needed, the procedure is repeated for a sufficiently large number 
Nr of surface profile realizations (ergodicity is assumed). Finally, various 
numerical tests are performed as regards energy conservation, reciprocity, 
and convergence with increasing N and Nr (cf. [28]). 

It should be mentioned that the assumption that the upper propagating 
medium (z > C(a:)) is vacuum with = 1 can be easily modified to account 
for any other isotropic and homogeneous dielectric medium by conveniently 
renormalizing the dielectric functions and the incoming beam wavelength [30] . 
Also, despite the fact that the region occupied by the metal is semi-infinite, 
our theoretical model can be in practice applied to metal films on a semi- 
infinite substrate provided that their thickness be sufficiently large compared 
to the metal skin depth d = c(e^(w))“^/^/u;. However, in the case of Ag 
island films for which plasmon resonances on isolated particles are crucial 
[5,6], the surface geometry should be correspondingly modified to account 
for such close contours (see e.g. Madrazo et al. in this book). 



3 Fractal Surface Model 

The theoretical formulation described in the preceding section can be applied 
to any kind of either random or deterministic surfaces provided that the pro- 
file function is known. We restrict to random fractal surfaces that reproduce 
the properties of many naturally occurring surfaces [17,19,31]. These are self- 
affine fractals, which are modeled henceforth by the time dependence of the 
trace [i.e. the x dependence of the height z = C(3^)] of ^ one-dimensional 
fractional Brownian motion (fBm) . Self-affinity implies that the scaling rela- 
tions posses different ratios depending on the direction (along x and z). It is 
characterized by the local fractal dimension D = 2 — H, H being the Hurst 
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Fig. 2. Realizations of the self-affine ran- 
dom fractal profiles numerically generated 
by means of the Voss’ algorithm: <5/ A = 0.05. 
The corresponding fractal dimensions are: 
(a) D = 1; (b) D = 1.2; (c) D = 1.5; (d) 
D = 1.8 (e) D = 1.9; and (f) D = 1.99. 



exponent; the global fractal dimension is 1 (recall that for self-affine frac- 
tals the fractal dimension is not uniquely defined). In order to generate such 
fractal surfaces to an arbitrary resolution, we employ the successive random 
addition algorithm proposed by Voss [31]. 

By assigning a length L = 52.7/xm to each realization, the sequences of 
random fractal numbers numerically generated as mentioned above are map- 
ped onto real surfaces of the kind employed in SERS experiments [16,17]. The 
resolution {N f = 1024) is such that the minimum size of the surface asperi- 
ties Ax = 51.45 nm closely resembles the typical sizes of the metal particles 
that form some widely used SERS substrates [13,16]. Moreover, to reproduce 
such slowly varying profile below the lower spatial limit associated with the 
particle shape, a cubic spline interpolation with rii = 1 points is performed. 
In this regard, it should be remarked that the rms heights [(5 = (C^(3;))^^^] of 
the fractal surfaces studied throughout this work are also comparable with 
such typical sizes (see Fig. 2). 



4 Near EM Field 

We now present the numerical results obtained as described in Sect. 3 for the 
self-affine random fractal surfaces defined in Sect. 2. In all the cases shown 
below, unless otherwise stated, a length L =20.58 ^m consisting of fV = 
800 = 400(rii-|-l) (extracted from N f = 1024) is taken per realization, and the 
intercept of the Gaussian incident beam with the plane of the mean surface 
is kept constant regardless of the angle of incidence {W / cos^o = ^/4). This 
intercept illuminates a sufficiently large region of the fractal surface in terms 
of the range of incident wavelengths considered below (0.4 /xm< A <1.1 /xm). 
It should be emphasized that the choice of scattering parameters, leaving 
aside its being very realistic from the experimental standpoint, guarantees 
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(a) D=1.2 




-5 -2.5 0 2.5 5 

X (microns) 



(b) D=1.99 




-5 -2.5 0 2.5 5 

X (microns) 



Fig. 3. Contour density plot of the near field intensity resulting from the p-polarized 
scattering from a Ag fractal surface with = 0°, 5 = 25.725 nm, A = 514.5 nm 
(e'' = —9.4 + iO.8), L =10.29 /rm, W = L / AcosOq, and N = 400. (a) D = 1.2, and 
the maximnm intensity is /max = 0.05; (b) D = 1.99 and /max = 0.11. 
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additionally that the random surface exhibits self-affine scaling for a large 
range of lengths (approximately from below one tenth of the wavelength to 
beyond the illuminated area), thus covering the relevant scaling interval of 
this scattering problem. 

In Fig. 3, the intensity of the p-polarized EM near field for normal in- 
cidence is shown in a region of about one wavelength above two Ag fractal 
surfaces of length L =10.29 /xm 5 = 25.7 nm, corresponding, respectively, 
to D = 1.2 (Fig. 3(a)) and D = 1.99 (Fig. 3(b)). At the wavelength of 
the incident beam (A = 514.5 nm), the dielectric function of silver is [32] 
e^g(w) = —9.4 -I- to. 8 . Both in Figs. 3(a) and (b), it is evident that the in- 
cident Gaussian beam impinging on the Ag surface (lower part of the plot) 
is scattered due to the surface roughness: The interference between the in- 
cident and scattered fields gives rise to the observed intensity pattern with 
horizontal fringes of length limited by the beam width. In Fig. 3(a) the EM 
field appears to be weakly scattered, and a strong specularly reflected beam 
results in a well defined interference pattern, with the first maximum above 
the surface close to z = c( 27 t — (j)fi) / (2a;) = 231 nm {(pR being the phase shift 
introduced by the Fresnel coefficient on reflection). The more rugged surface 
profile with D = 1.99 used in obtaining the results shown in Fig. 3(b) scatters 
the EM field more strongly, thus leading to both a weaker and more distorted 
reflected beam (the interference pattern appears blurred), and to larger EM 
field intensities in the proximity of the surface 2 < A/4 = 128.625 nm. 

To gain physical insight into the origin of such FE, let us calculate the 
EM field resulting from subtracting the incident and the singly scattered (in 
the Geometrical Optics sense) fields from the total field. The latter field is 
numerically obtained from the surface integral in the Ihs of ( 1 ), upon repla- 
cing the exact source functions by their Kirchhoff approximation expressions 
[24,26]: 



U^{[x,C{x)],uj) = [1 T Ri3{0{x))] U^"\[x,C{x)],uj), (11) 

Rfj being the Fresnel coefficient for the local angle of incidence 6{x) = 
6*0 — dC,/dx, and the minus (respectively, plus) sign corresponding to P = s 
(respectively, P = p)', a similar expression holds for the normal derivative 
[28]. In Fig. 4, the intensity of the component of the doubly (and highly) 
scattered EM fields is shown for the same fractal surfaces as used in Fig. 3, 
but for a smaller region in the vacuum up to 2 « A/4 above the surface. We 
observe in both Figs. 4(a) and 4(b) an interference pattern whose spatial 
frequency along the surface indicates the existence of SPP with wave vector 
k = {w / c){e^ {co) / {I + e''(a;)))^/^ propagating in opposite directions. Moreo- 
ver, the excitation of SPP has been verified by calculating the intensity from 
Fig. 4(b) along the line x = 0 away from the interface, confirming that it clo- 
sely follows the well known SPP exponential decay in the form exp[— 2/3(a;)z] 
with decay length £ = (2P{uj))~^ = 5i[c(— 1 — e^^(a;))^/^/(2a;)] « 119 nm (Ji 
denoting the real part). Whereas for the surface with lower fractal dimension 
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(a) D=1.2 




- 2-1012 

X (microns) 



(b) D=1 .99 




- 2-1012 
X (microns) 



Fig. 4. Same as Fig. 3 but only for the doubly (and highly) scattered field intensity 
and a smaller near-field region, (a) D = 1.2 and /max = 0.00056; (b) D = 1.99 and 




12 Surface-Enhanced Raman Spectroscopy 225 



3 

Q. 



..MTfTjf 


1 (a)- 

1 1 


..MTTTjf 


' 1 

(b). 

1 , ' 




(c). 



3 4 



0.3 

0.2 

0.1 

0.0 
0.3 

1 0.2 

0.1 

0.0 
0.3 

0.2 

0.1 

0.0 



(d)-^ 



1 









(e)-^ 



(f) 



lil'llThTTTT- 



10 15 



Fig. 5. PDF of the p-polarized 
FE factor p{ap{u})) for Ag 
fractal surfaces with rms 
height 5 = 25.725 nm and 
several fractal dimensions: 
D =1 (a), 1.2 (b),1.5 (c),1.8 
(d), 1.9(e), and 1.99(f). Other 
parameters are: A =514.5 
nm (e^ = —9.4 + iO.8), 

W/X = 10cos6lo, L = 20.58 
nm, and N = 800. Average 
over Ndata = 200000 points 
from Nr = 20 realizations and 
9o =0°,5°,... ,60°. 



D = 1.2 the intensity of this field component is very weak (two orders of ma- 
gnitude smaller than the incident field), there exist very strong intensities in 
Fig. 4(b), comparable to those of the incident and singly scattered EM fields. 
Further evidence that the roughness-induced excitation of SPP is responsible 
for the occurrence of large FE is given by the polarization selectivity (the 
results for s polarization, not shown here, yield no relevant FE). 



5 Surface EM Field Enhancement 

Since the higher FE values are found precisely on top of the metal surface, 
in what follows we concentrate on the roughness and frequency dependence 
of the surface EM field. For the random surfaces we are dealing with, this 
surface EM field is a stochastic variable [24,26], so that its PDF will be 
studied below. To that end, the FE factor [cf. (10)] is calculated within the 
central region of the incident beam spot —W/cosOq < x < W/cos9q on 
every surface profile from a set of Nr = 20 realizations, and for several angles 
of incidence = 0°,5°,... ,60°. The PDF p(cr/ 3 (w)) is then obtained from 
200000 data points. 

5.1 Surface Roughness: Flactality and RMS Height 

Figure 5 shows the p{ap(uj)) for Ag fractal surfaces with 6 = 25.725 nm 
and different local fractal dimensions D = 1, 1.2, 1.5, 1.8, 1.9, 1.99, at the wa- 
velength A = 514.5 nm (e^g(w) = -9.4-1- i 0.8). It is evident in Fig. 5 the 
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<^P 



Fig. 6. Same as Fig. 5(e), but for S 
(nm)=25.725, 51.45, 102.9, 205.8, and 308.7. 



strong dependence of the FE on the fractality of the rough surface. When the 
fractal dimension is small, the excitation of SPP is weak [see also Fig. 3(a) for 
D=1.2], and the surface EM field is basically given by the Kirchhoff approxi- 
mation [cf. (11)]. This is clearly seen in Figs. 5(a) and (b) for = 1 and 1.2, 
respectively: p{<Jp) takes the form of a convolution of narrow distributions for 
each 9q centered about the flat surface FE factor «| 1 -I- Rp{Oo) P (with 
the maximum close to the normal incidence value dp^^ ~ 3.53). Upon increa- 
sing the fractal dimension, the roughness-induced excitation of SPP increases 
so that the surface FE can reach values larger than (and also smaller, as 
expected from energy conservation). The distribution of p{crp) thus widens 
as seen in Figs. 5(c) and (d) for D = 1.5 and 1.8. For high fractal dimensions 
I? = 1.9 and 1.99, the larger surface FE values are encountered, and the broa- 
dening of the PDF becomes even more pronounced, manifesting a slow decay 
for values of dp beyond the maximum probability [see Fig. 5(e) and (f)[. Our 
results for Au and Cu, not shown here, manifest the same dependence of the 
PDF on the fractal dimension. 

Therefore, despite the fact that the rms deviation of heights of the en- 
sembles of random fractal surfaces used in Fig. 5 are all the same, the PDF 
differs significantly from one fractal surface to another, in such a way that 
the higher the local fractal dimension is, the larger are the surface FE found. 
The physical reason for this behavior has been pointed in [24]. The surface 
power spectrum, which plays the role of the scattering potential, exhibits a 
power law decay -with increasing spatial frequency- that is slower for higher 
D. Hence, upon increasing the fractality the roughness-induced photon-SPP 
coupling is favored, which lies on the basis of the existence of large FE for 
the configuration being considered. 

In Fig. 6 the influence of the rms roughness is shown by plotting p{dp) for 
5(nm) = 25.725, 51.45, 102.9, 205.8, and 308.7, in the case of D = 1.9 [the 
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other parameters as in Fig. 5(e)]. Note that the increase of the rms height 
further broadens the PDF: In fact, the maximum probability shifts to lower 
values of dp (down to zero for S > 102.9 nm). Nevertheless, the tail of the 
distribution substantially grows, thereby revealing the occurrence of large 
surface FE. We have found values of up to ~ 10^, not only for the Ag 
fractal surfaces used in Fig. 6, but also for Au and Cu surfaces with analogous 
roughness parameters; the average surface FE’s, however, lie well below (ab- 
out 2 orders of magnitude). Interestingly, we have not gone further up in the 
increase of S for the sake of computation constraints (namely, large number of 
points N required to accurately sample such fractals with steep slopes) . With 
increasing roughness, the onset of the Anderson localization regime [27,28], 
owing to the strong SPP interference induced by the randomness, also helps 
to reinforce the surface FE. Anyhow, it should be mentioned that the SPP 
reradiation mechanism beyond some value of S prevents dp from becoming 
arbitrarily large in this configuration [28]. This is what the results shown in 
Fig. 6 seem to indicate, for the tail growth saturates beyond S = 102.9 nm. 

In light of the results of Fig. 6, it is tempting to say that large dp would 
also appear for the surfaces with small D by sufficiently raising <5. We have 
verified that this is not the case by calculating p(dp) as in Fig. 5(b) with 
D = 1.2 but for S = 514.5 nm (not shown here). In spite of the strong rms 
height of this fractal surface (formally equivalent to a large overall scattering 
potential), the fact that the relative weight in the surface power spectrum 
of the higher spatial frequency components with respect to the lower ones is 
preserved (for it only depends on D) hinders the excitation by light of SPP 
(and thus the related FE) for D < 1.5. It is worthwhile mentioning, however, 
that a weak FE not connected to the SPP excitation can be observed within 
the selvedge of this kind of surfaces when the rms height is high enough as 
to ensure the occurrence of double (or higher) scattering processes in the 
geometrical optics sense; this has been addressed in [28]. This kind of weak 
FE, which we believe irrelevant to SERS in most widely used experimental 
configurations, is also found for s-polarized waves. In this regard, we would 
like to point out that p(ds), although not shown here, resembles for all the 
fractal surface parameters considered thus far the narrow distribution of p(dp) 
for D = 1 shown in Fig. 5(a), confirming that no significant FE takes place 
for s polarization (transverse electric modes). 



5.2 Excitation Frequency: Ag, Au, Cu 

The wavelength dependence of p{dp{uj)) is studied in Fig. 7 for D = 1.9 
and S = 102.9 nm for the excitation wavelengths A(nm) =413.3, 514.5, 619.9, 
729.3, 826.6, and 1064 (where the experimental values of the Ag dielectric fun- 
ction at such wavelengths are obtained from [32]). When the wavelength di- 
minishes, the maximum probability value decreases approximately following 
the Fresnel coefficient dependence at normal incidence [cf. (11)]. At the same 
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Fig. 7. PDF of the p-polarized FE fac- 
tor p(o-p(o))) in a log scale for Ag fractal 
surfaces with D = 1.9 and 5=102.9 nm: 
A(nm) =413.3, 514.5, 619.9, 729.3, 826.6, 
and 1064 (correspondingly, = —4.41 -|- 
iO.73, -9.4-f iO.8, -15.04 -f il.02, -22.45 -f 
il.4, —30.2 -I- il. 6, and —48.8 -I- i3. 16). Other 
parameters as in Fig. 5. 




Fig. 8. Spectral dependence of (a) mean FE 
and (b) absorption for the same fractal sur- 
face used in Fig. 7. (-I-), Ag; (x). An; (o), 
Cu. 



time, the distribution widens for lower wavelengths, indicating that the frac- 
tal surface scatters light relatively more strongly. This stems from two facts: 
on the one hand, the Ag surface-plasmon wavelength \sp = 2^/^Ap « 222.2 
nm is being approached [12]; on the other, the roughness parameters in terms 
of the excitation wavelength become larger for smaller wavelengths. With re- 
spect to the occurrence of large FE, we observe in Fig. 7 that the tail of large 
(7p(oj) is built up with decreasing wavelength down to A = 619.9 nm, as so- 
mehow expected according to the preceding argument, but becomes smaller 
again upon further lowering the excitation wavelength to A = 413.3 nm. 

This is more clearly seen in Fig. 8(a), which shows the wavelength de- 
pendence of the ensemble average value of ap for Ag, Au, and Cu. The ex- 
perimental values of the dielectric functions for Au and Cu are also obtained 
from [32]. The behavior for the lower wavelengths is due to the increase of 
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Fig. 9. PDF of the p-polarized FE factor 
p{ap{u})) in a log scale for Ag, Au, and Cu 
fractal surfaces with D — 1.9 and 5=308.7 
nm at the optimum excitation frequency. 
Solid curve: Ag at A =619.9 nm; dashed 
curve: Au at A =729.3 nm; long-dashed 
curve: Cu at A =729.3 nm. Correspondingly, 
e<^ = -15.04 -f il.02, = -18.9 -f il.43, 

= — 19.6-|-il.98. Other parameters as in 
Fig. 5. 



the bulk absorptive losses of propagating SPP when approaching the surface- 
plasmon wavelength. This increase in the absorption is strengthened for Au 
and Cu, since the onset of interband transitions for these two metals, which 
contribute to raise considerably the imaginary part of the dielectric function 
[32], is reached for the wavelength A « 500 nm, in contrast to the case of 
Ag, for which this onset does not appear within the frequency range covered 
by Fig. 8. For large wavelengths, (cTp) behaves similarly for the three me- 
tals, diminishing as the ratio of the roughness parameters to the incident 
wavelength also decreases (namely, the incident radiation tends to “see” a 
smoother surface). Therefore, Fig. 8 indicates that there exists an optimum 
excitation wavelength, about A = 619.9 nm for Ag and A = 729.3 nm for Au 
and Cu, at which the maximum ap is found for the kind of fractal surfaces 
dealt with in this work. 

In Fig. 8(b) the absorption spectrum derived from the average reflectance 
calculations upon exploiting energy conservation, is shown for the fractal 
surfaces used in Fig. 8(a). Interestingly enough, in our case the maximum in 
the absorption spectrum does not necessarily correlate to the maximum FE, 
for there are no narrow resonances. As a matter of fact, it should be mentioned 
that the preceding argument is valid as long as no other frequency-selective 
mechanisms are present, such as localized surface plasmons associated with 
EM surface-shape resonances [6] or even electronic transition resonances [9]. 

Finally, we compare in Fig. 9 the p(cTp) for Ag, Au, and Cu, at the cor- 
responding optimum excitation wavelengths [see Fig. 8(a)], in the case of a 
fractal surface characterized hy D = 1.9 and S = 308.7 nm. The three curves 
lie practically on top of each other, revealing almost identical behavior for 
these metal surfaces provided that the optimum excitation frequency is used. 
Again, the latter statement is only true in the absence of narrow resonances. 
A resonance lying within the range of interband transitions for Au and Cu, 
but outside the range for Ag, would lead to a much lower FE for Au and Cu 
than for Ag, owing to damping. In general, the fact that the onset of such 
transitions in Ag occurs at a wavelength A « 300 nm, favors its use as a 
SERS substrate. 
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Fig. 10. Angular distribution of the speckle 
pattern intensity scattered from the frac- 
tal profiles shown in Fig. 2 for p polariza- 
tion, A = 514.5 nm, 6o = 30°, Wj\ = 
10 cos 00, Lj\ = 40, and N = 800: D = 
1, 1.2, 1.5, 1.8, 1.9, and 1.99. The dielectric 
permittivity of silver is: e = —9.4 -|- iO.8. 



6 Far Field 

The angular distribution of far field scattered into the vacuum half-space 
above the interface is not typically studied in SERS calculations, since most 
experimental works measure the SERS signal either at a fixed scattering 
angle or by collecting all angles with a focusing lens. Nevertheless, the com- 
plete description of the far field is a classical problem by itself merely from 
the scattering theory, and might help to deepen our understanding in SERS. 
It can be calculated from (9) once the two source-like surface functions are 
obtained as mentioned in Sect. 2. Particularly, we study the angular distribu- 
tion of mean scattered intensity (MSI) by means of Monte Carlo simulation 
calculations for different fractal dimensions. 

The angular distribution of the speckle pattern intensities scattered from 
the fractal silver surfaces plotted in Fig. 2 is shown for p polarization in 
Fig. 10 for an angle of incidence 9q = 30°. Leaving aside the specular peak 
(which is not suppressed for the value of the surface rms height 6/X considered 
in Fig. 10), there exist differences between the speckle patterns for different 
fractal dimensions. Away from the specular direction, upon increasing the 
fractal dimension, the speckle intensity for both polarizations builds up and 
its oscillations become broader and less pronounced, manifesting the gradual 
onset of the higher spatial frequencies relative to the lower ones. 

To analyze the differences observed in the averaged properties of these 
speckle patterns, the angular distribution of the diffuse component of the 
MSI is shown in Fig. 11 for do = 0° and 30° for p polarization. Despite that, 
of course, the specular component is absent in Fig. 11, it is seen therein that 
a diffuse peak appears at the specular direction in all cases. This specular 
peak is higher and narrower the smaller the fractal dimension D is. However, 
the total diffusely scattered energy (integrated over all scattering angles) 
keeps constant regardless of the value of D. On the other hand, it should 
be emphasized that the p-polarized component is more strongly scattered 
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Fig. 11. Angular distribution of the diffuse 
component of the mean scattered intensity 
from silver, self-afHne fractal surfaces for p 
polarization, A = 514.5 nm, S/\ = 0.05, 
W/\ = 10 cos 00, L/\ = 40, and N = 800. 
Average over Nr = 400 realizations. Se- 
veral fractal dimensions are considered: D = 
1, 1.2, 1.5, 1.8, 1.9, and 1.99. (a) 9q = 0°, (b) 
9q = 30°. The dielectric permittivity of silver 
is: e = —9.4 -|- iO.8. 



and gives rise to a broader angular distribution of MSI than the s-polarized 
component (not shown here, cf. [25]). 

The results of Fig. 11 can be qualitatively interpreted in light of the power 
spectrum of the fractal surfaces g{\k\) {k being the spatial frequency), which 
accounts for the contribution to the diffuse MSI to lowest (second) order in 
the surface height through g((u;/c) | sin 0^ — sin^sj). In the case of the self- 
affine fractal surfaces we are considering here, the power spectrum is a power 
law with a negative exponent whose absolute value decreases with increasing 
fractal dimension [31]. This indicates that the larger the fractal dimension 
D is, the broader the angular distribution of {n{9s))diff should be, as the 
curves in Fig. 11 reveal. The success of this simple argument suggests that 
a small-amplitude perturbation theory of the diffusely scattered intensity 
would suffice to describe, even quantitatively, the results shown in Fig. 11, 
considering how small the parameter S/X is. This might be the case for the 
lower fractal dimensions D < 1.5. Nonetheless, it does not apply to the 
surfaces with higher fractal dimensions. 

The presence of higher spectral components in the surfaces with larger 
fractal dimensions should enhance the excitation of surface-plasmon polari- 
tons in p polarization, as inferred from the behavior of the surface magnetic 
field and the occurrence of large field enhancements (see Sect. 4 above). This 
would lead to the appearance of enhanced backscattering in the diffusely 
scattered intensity, resulting from multiple scattering interference mediated 
by the excited surface-plasmon polaritons. As expected, the results for p po- 
larization and 6*0 = 20° shown in Fig. 12 for a fractal profile oi D = 1.95 and 
S/X = 0.2 exhibit a weak backscattering peak; this peak is absent from the 
result for s polarization. 
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Fig. 12. Same as Fig. 11(a) but for D — 1.95, 9q — 20°, 
5/\ = 0.2, and Nr = 500. Solid curve: p polarization; 
Dashed curve: s polarization. 



7 Concluding Remarks 

We have seen that the rigorous scattering theory formulation based on the 
exact Green’s theorem integral equations can be successfully applied to ac- 
count in the full electrodynamic sense for the near EM field found on a corru- 
gated metal surface illuminated by an incoming monochromatic beam. The 
occurrence of large FE in the vicinity of self-affine fractal surfaces of Ag, Au, 
and Cu has been explored through numerical simulation calculations of both 
the near field pattern and the PDF of surface field enhancements _p(ct/ 3 ). This 
is important to the understanding of the SERS EM mechanism on substrates 
consisting of fractally corrugated metal surfaces. 

The near field intensity contour maps give a rigorous picture of the EM 
scattering mechanisms in the vicinity of the vacuum/metal interface. The 
behavior of the non-specular component of the scattered field intensity as a 
function of the spatial coordinates indicates for p polarization the existence of 
forward and backward propagating SPP along the interface whose amplitude 
decays exponentially away from the mean surface profile. This in turn reveals 
that the roughness-induced excitation of propagating, p-polarized SPP lies 
on the basis of the occurrence of large FE relevant to SERS in this confi- 
guration. The calculated PDF of the surface FE accounts for the transition 
from a narrow distribution centered about the planar surface FE [cf. (11)] for 
weakly scattering surfaces, to a broad, slowly decaying function when large 
Up takes place. In addition, the Upiuj) for a given frequency increases with 
the local fractal dimension and with the rms height. Fractality seems to be 
more critical: no significant surface FE are encountered for small local fractal 
dimensions regardless of the rms height, whereas for large D even a very low 
surface roughness 5/\ ~ 0.05 suffices to produce moderate FE. 

The frequency dependence analysis of the PDF of surface FE manifests the 
existence of an excitation wavelength for which the surface FE is maximized. 
This optimum excitation wavelength observed for large fractal dimensions 
(at about A « 620 nm for Ag, A « 730 nm for Au and Cu) stems from a 
compromise between the fractal scatterer strength (given by the roughness 
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parameters normalized by the incident wavelength) responsible for light-SPP 
coupling, and the SPP absorptive losses within the bulk metal. In addition, 
it has been shown that the optimum wavelength does not coincide with that 
at which the absorption spectrum yields a maximum. (Recall also that no 
frequency resonances are present here.) All Ag, Au and Cu surfaces mani- 
fest similar abilities to produce large PE’s provided that the corresponding 
optimum excitation wavelength is used. 

Large local FE maxima have been found {ap < 10^, so that « 10®); 

however the average values are relatively modest. Leaving aside the existence 
of other non purely EM mechanisms, we believe this to be due to the choice 
of fractal surface model, which nonetheless has proven to be very useful for 
both understanding the EM processes being involved and quantitatively esta- 
blishing rigorous criteria as to the role played by surface fractality. The EM 
scattering formulation presented here could be used to study other surface 
models and even adapted to take into account more complicated configura- 
tions. 

On the other hand, the angular distribution of the diffuse component of 
mean scattering intensity exhibit a specular peak: The larger is the fractal 
dimension, the broader and smaller this peak becomes. This is qualitatively 
explained in terms of the power spectrum of the surface roughness (which 
partially accounts in turn for the lowest order contribution in the surface 
height to the perturbative expansion of the MSI), given by inverse power 
laws with a slower decay rate for the self-affine fractals with higher fractal 
dimensions. In addition, a small backscattering peak in p polarization is found 
for fractal surfaces with large fractal dimension and sufficiently large rms 
height (but smaller than the wavelength), as a manifestation of the weak 
localization of surface-plasmon polaritons. 



8 Perspectives 

One of the limitations of the analysis of the surface FE shown in Sect. 5 is the 
fact that it is based on the magnetic field calculations. Although this is stron- 
gly recommended for such one-dimensional geometries and linear transversal- 
magnetic polarization from the theoretical and computational standpoints, it 
is nonetheless true that the physical quantity that the molecule sees, entering 
into the Raman polarization vector scaled by its Raman polarizability, is the 
electric field. Though the FE calculations based on the magnetic field are 
inherently related to those based on the electric field, information on pola- 
rization is surely lost. We have put forth a simple method to calculate the 
normal and tangential components of the electric field [33] that permits to 
address in detail the surface electric FE along preferred directions, relevant 
to SERS selectivity rules. 

In addition, rigorous models describing the scattering of the Raman sig- 
nal coming from the molecule and detected in the far field, and thus the 
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corresponding Raman FE, should follow the present work. Dipoles standing 
for single-molecule vibrating modes have been widely used in the literature, 
though typically nearby simplistic metal surfaces. This has to be done for 
realistic metal surfaces as the those dealt with here. Other approach we are 
working on consists of a surface Raman polarization discontinuity [34] that 
could be applied to Langmuir-Blodgett films [9] . 

In spite that SERS spectroscopy is a well established technique, the afo- 
rementioned theoretical works might shed light into it, particularly bearing 
in mind the recent experimental work reporting on SERS single-molecule 
detection through near-held optical microscopy and claiming the existence 
of anomalously large SERS FE factors [35]. Furthermore, new interesting 
phenomena and/or experimental conhgurations can be predicted from such 
studies. 
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Abstract. Nonintrusive optical techniques for accurate sizing of discrete particles 
(or bubbles) suspended in continuous media have been studied for years and have 
reached, for some applications, an advanced stage of development. In contrast, a 
related problem of considerable importance that has received less attention is that 
of accurate characterization of fine particles or other features/defects on and under 
surfaces. This capability is much needed in the helds of semiconductor and inte- 
grated circuit fabrication, digital storage media manufacturing, and for research on 
contamination of optical components for both earth- and space-based applications. 
In this presentation, we show the development of theoretical based scattering mo- 
deling and comparisons between our experimental and theoretical studies of light 
scattering by features on and under surfaces relevant to semiconductor wafer inspec- 
tion. The experimental work shown here involves measurements of angle-resolved 
light scattering signatures (for 64 scattering angles) from an individual spherical 
particle 0.482 fim in diameter on a bare Si wafer. Experiments and predictions show 
good agreement, and the models are being used in predicting the performance of 
wafer scanners for various experimental conditions and for assisting in the design 
of future-generation surface defect characterization instruments. 



1 Introduction 

Integrated circuit chips have an ever increasing effect on almost every aspect 
of life, and technology continues to dictate that the smallest, or critical, di- 
mensions of these IC’s reach into the smaller and smaller regions of the nano- 
scale. This trend presents the semiconductor industry a manufacturing chal- 
lenge to produce smaller and more densely packed integrated circuitry. With 
smaller circuitry, particle contamination and manufacturing defects must be 
more closely controlled to avoid loss in production yield. The challenge to 
the manufacturer is to produce chips in a contamination-free environment 
with as few defects as possible. This increasing emphasis on improving chip 
manufacturing yield is very critical since the amount of capital investment 
involved is immense and competition is fierce. 

With the increasing number of devices, the smallest dimension of leading- 
edge production devices has decreased to well below 0.2 /im. A rule-of-thumb 
in the industry states that a ’’killer” contaminant which may render a chip 



F. Moreno and F. Gonzalez (Eds.): Lectures 1998, LNP 534, pp. 237-252, 2000. 
© Springer-Verlag Berlin Heidelberg 2000 




238 



B.M. Nebeker and E.D. Hirleman 



useless is one-third to one-half the size of the chip critical dimension. [1] The 
critical dimension of a ’’killer” defect is projected to decrease to 0.06 ^m in 
2001 and 0.02 ^m in 2006. [1] 

It is imperative to detect particle contamination and defects that appear 
during the manufacturing process. Contamination may come from equipment, 
non-clean environments, and the workers themselves. Defects and contamina- 
tion also result from unsatisfactory photolithography and etching processes 
which involve the addition or removal of material on the chips. With pro- 
per detection of contamination and defects, manufacturing problems can be 
isolated and their causes corrected quickly. Failure to detect defects and con- 
tamination may result in significant losses in production yield and profits. 
Surface scanning inspection systems (SSIS) are used to detect particle con- 
tamination and manufacturing defects. Generally, to detect surface features, 
these inspection systems measure scattered light from laser beams that are in- 
cident upon wafer substrates. Information about the features can be deduced 
from the pattern and magnitude of the scattered light irradiance. The size, 
shape, and material of a scatterer as well as the wavelength and polarization 
of the incident beam affect the characteristics of the scattered light. 

Increasing demand for more accurate and sensitive detection systems cau- 
sed by the decreasing critical IC dimensions has led to the development of 
new, non-intrusive detection systems. Development of new systems requires 
large investments of capital, thus careful efforts in the design process must 
be taken to ensure good return on investment. Development of new systems 
can be approached by two methods. In one method, careful light scattering 
experiments may be conducted in a laboratory environment to determine 
scattering characteristics from semiconductor wafer defects and the require- 
ments for optimum detector design. The advantage of this approach is that 
it provides a physical environment that may be similar to the actual manu- 
facturing environment, and these conditions can be quantified as part of the 
experiment. The disadvantages of this approach are the high costs for careful 
experiments, and the limitations to provide results that encompass the ent- 
ire range of features, defects, substrates, and detector systems that exist in 
manufacturing. For this reason, an alternative approach for inspection tool 
design is to develop software to model the light scattering and detection sy- 
stems. The advantages of this approach are that a wide range of variables 
can be studied quickly and a smaller amount of capital investment is usually 
needed as compared to experiment. The software could be used by a large 
number of modelers resulting in wider range of results than could possibly 
be conducted under experimental conditions. The disadvantages of this ap- 
proach include the difficulty to determine the model accuracy and the ability 
to account for unknown parameters which may exist in the manufacturing 
environment. The best approach for the design of detection systems is to 
utilize the experimental and numerical approaches in conjunction with each 
other. If conducted simultaneously, experiments can be used to quantify the 
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accuracy of the numerical codes, and the numerical codes can be used to 
determine the accuracy of the experiments conducted. 

In this study, we will first provide a background into the study of scatte- 
ring from features on surfaces. A review of numerical methods used to predict 
scattering from features is given, and a history of scattering experiments for 
features on surfaces will be given. Next, development of the discrete-dipole 
approximation (DDA) method to model light scattering from features on 
surfaces is described. Predictions of this method will then be compared to 
scattering experiments performed at Arizona State University on features 
found on semiconductor wafers. 

2 Literature Survey 

This section provides a survey of literature associated with numerical mode- 
ling of light scattering from features on surfaces and experimental procedures 
used to measure scattering. 



2.1 Numerical Modeling 

The road leading to the development of numerical methods to model electro- 
magnetic scattering from a wide range of features of any shape has been paved 
by elementary methods developed for specific cases, but whose accuracy and 
range of applicability are too limited for the needs of the semiconductor in- 
dustry. Early models investigated scattering from isolated particles with no 
surface interaction. As early as the turn of this century, scientists investiga- 
ted light scattering from dust particles to explain their effect on the color of 
the sky. Lorenz in 1890 [2] and Mie in 1908 [3] independently developed ana- 
lytical methods based on Maxwell’s equations to determine light scattering 
by a sphere in the presence of an electromagnetic plane wave. The analyti- 
cal methods employed the Bessel and Hankel functions, which were difficult 
to compute accurately before the use of computers as mathematical tools 
became practical during the 1960’s. 

The advent of powerful computers allowed rigorous modeling of Maxwell’s 
equations to be possible. Peltoniemi [4] provides a detailed discussion of nu- 
merical approaches for the electromagnetic scattering problem. Most nume- 
rical techniques involve the discretization of the scattering volume, applying 
Maxwell’s equations to each sub- volume of the scatterer, creating a set of 
algebraic equations. One family of these discretization numerical methods 
include the finite-element and finite-difference methods. When applied to the 
Maxwell equations, very large but sparse matrices are solved [5,6]. While the 
sparseness of the matrices provides a modeling advantage, disadvantages lie 
in the difficulty to determine and apply correct boundary conditions for the 
problem. Non-standard procedures are often needed for these methods, and 
these procedures may lead to uncertainty of the modeling accuracy [7]. 
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Another Maxwell equation based model is the volume integral equation 
technique [4]. This technique includes a family of closely related methods 
where the volume of the scatterer is divided into a number of sub-volumes 
approximated with uniform internal fields where point matching techniques 
are used to solve for the fields within these sub- volumes. Several methods ba- 
sed on this technique are known as the volume integral equation formulation 
(VIEF) method [8], the method of moments (MoM) [9,10], the digitized 
Green’s function method (DGFM) [11], and the discrete-dipole approxima- 
tion (DDA) method, also known as the coupled-dipole method [12,13]. The 
cells of the scatterer created by these methods are considered as dipoles. 
In these methods, a number of dipoles are positioned in a lattice to mo- 
del the volume, shape, and composition of a scattering feature. The electric 
field at each dipole includes contributions from the incident beam field, the 
field reflected from the surface, and the field created by superposition of the 
fields from each dipole on the lattice. The solution procedure determines the 
electric field at each point in the lattice. Basically, the only mathematical dif- 
ference between the methods of the volume integral equation family is in the 
way by which the matrix diagonal elements, called the dipole polarizabilities 
in the DDA method, are approximated. Peltoniemi describes the differences 
among the volume integral equation methods that, during their development, 
the DDA methods emphasize the role of dipoles as radiating physical units, 
while the MoM/VIEF/DGFM methods consider dipoles as abstract mathe- 
matical tools without any deeper meaning, replaceable with anything which 
appears better in terms of modeling [4] . For a more detailed discussion of the 
relationship between the DDA method and the MoM method, refer to the 
works of Lakhtakia [14,15]. 

The progression of light scattering modeling from features in free-space 
to features in the presence of an interacting surface leads to more detailed 
and complicated models. The interacting surface complicates the scattering 
problem, as the surface boundary conditions modify the scattered field cha- 
racteristics. Development of models for features in the presence of interacting 
surfaces followed the same path taken by free-space modeling - first, restric- 
tive analytical models were developed, and as the study of light scattering 
progressed, methods based more rigorously on Maxwell’s equations were de- 
veloped. Some of the first models for features on surfaces were based on the 
Lorenz-Mie theory for light scattering from spherical features with flat sur- 
face interaction using plane- wave expansions of the scattered field [16,17]. 
Particle contaminants on wafer substrates in the semiconductor industry ge- 
nerally are not spherical, so the Lorenz-Mie formulation may not provide 
predictions with acceptable accuracy. Another method to compute scatte- 
ring from features on surfaces uses the exact image theory to model features 
above a perfect conductor [18,19]. Most substrates in the semiconductor indu- 
stry are not perfect conductors, and the features were confined to be spherical 
when using this theory, so other approaches need to be considered. In 1995, 




13 Semiconductor Wafer Inspection 241 



Videen[20] developed a model for arbitrarily shaped features above a conduc- 
ting surface. Wojcik et al. [21] and Liswith [22] have used the time-domain 
finite-element method for Maxwell’s equations to model scattering from fea- 
tures on surfaces. Along with discretizing the scatterer when developing the 
model, the interacting surface is discretized as part of the surrounding me- 
dium. With this method, application of the boundary conditions can be very 
difficult and the computational requirements are significant. 

Several methods of the volume integral equation technique have been 
applied to the electromagnetic scattering problem with surface interaction. 
One of the most recent models developed is called the discrete-source method 
(DSM) which entails the use of linear combinations of fields from dipoles and 
multipoles to solve Maxwell’s equations and the boundary conditions [23]. 
Recently Ivakhnenko and Eremin have proposed a VIEF model for features 
above surfaces [24]. 

The numerical method discussed in detail for this study is the discrete- 
dipole approximation (DDA) when the surface boundary interaction is ap- 
plied. Taubenblatt [25,26] modified the free-space DDA method by employing 
the Sommerfeld integrals for dipole radiation above an interacting surface. At 
Arizona State University, Schmehl [27] and Nebeker [28] developed a DDA 
model with surface interaction (DDSURF) inspired by the free-space DDA 
code developed by Draine and Flatau [29] (DDSCAT). A two-dimensional 
Fast Fourier Transform was introduced to reduce the number of computatio- 
nal operations required to solve the matrix equations of the model [30]. A 
post-processor (RDSURF) was developed by Nebeker et al. [31] which used 
the computed internal scatterer field to compute the far-held scattering. The 
development of DDSURF has been conducted concurrently with experiments 
conducted at ASU and comparisons have been made between experiment and 
numerical prediction [31-33]. 

2.2 Experimental Measurements 

Weber conducted some of the hrst careful experimental measurements of 
angle-resolved scattering by individual particles on bare surfaces [34]. Light 
scattering from polystyrene latex (PSL) spheres on a nickel substrate was 
measured. Weber compared the experimental results to a model that used 
the Lorenz-Mie theory for a spherical particle above a surface with angle re- 
solved measurements of light scattered from different sized PSL spheres on 
silicon substrates. Bawolek [35] extended the work of Weber by investigating 
other sizes of PSL spheres, and different substrate materials. Warner [36] 
continued the work using the Arizona State University Scatterometer and 
investigated the effect of feature refractive index on the far-held scattering 
by considering diherent scatterers. This was an important extension to the 
work by Bawolek [34] because, in the semiconductor industry, most detection 
systems are calibrated using standard PSL spheres, but since most conta- 
mination during the semiconductor manufacturing process is not from PSL 
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features. Contamination may result from features composed of aluminum, 
silicon dioxide, tungsten, etc., and calibration to determine contaminant size 
based on PSL spheres may give misleading results. The materials that War- 
ner studied included silicon dioxide, tungsten, aluminum and silicon. Starr 
[37] brought this work to fruition with the investigation of light scattering 
from features more representative of what is found in the semiconductor in- 
dustry. These features composed of silicon dioxide material on silicon wafers 
were present on the Arizona State University/Semiconductor Research Cor- 
poration (ASU/SRC) section of the SEMATECH Patterned Wafer Defect 
Standard (PWDS) developed by Scheer of VLSI Standards, Inc. [31-33,38]. 
The structures were simple enough for accurate characterization by numeri- 
cal scattering models, and generally represented structures typically found in 
the semiconductor industry. 

3 The Discrete-Dipole Approximation Method 

In this section, we discuss the development of the discrete-dipole approxima- 
tion (DDA) method used here to model the scattering from features. In the 
DDA method, a lattice of dipoles is used to model features present on or near 
a surface. The feature shape is modeled by the arrangement of the dipoles, 
and the physical properties of the feature are modeled by designating the 
polarizeability of each dipole. Figure 1 shows a dipole lattice configuration 
for a spherical particle on a surface. 




Fig. 1. Dipole configuration for a spherical particle on a surface. 



3.1 Internal Field Computation 

When an electrical field is applied to the dipoles, dipole moments are induced 
at each dipole. At a dipole i, the dipole moment Pi is related to the total 
electrical field Etot,iby the dipole polarizeability through 
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Recently, considerable effort has been taken in the development of the 
polarizeability model used for the DDA. Draine and Goodman [39] discuss 
several models. The lattice dispersion relation (LDR) is the version which 
has been most recommended for accuracy. 

The total electrical field at the dipole i, Etot,i, is the summation of the 
incident field - including the direct beam field and the beam field reflected 
from the surface, Einc,i, the field by direct interaction between the dipoles 
in the lattice, and the dipole fields reflected from the surface, E^ef^i- 

Substituting the summation of the fields into 1 and rearranging, we get 



(o^z) pt dir,i ~^refd — inc,i (2) 

The electrical field by direct interaction between the dipoles, Edir,ii can 
be represented as 



1.2 " 

(3) 

where Gij is the dyadic Green’s function [30], and N is the total number of 
dipoles in the lattice. 

The electrical field interaction between dipoles with reflection from the 
surface, Eref.i, can be represented as 




The components of the 3a;3 matrix are given by the Sommerfeld integral 
terms as described in Nebeker [28] . The Sommerfeld integral terms are com- 
puted numerically using routines developed by Lager and Lytle, [38] where 
ki&nd k 2 are the wave numbers in the above surface medium and substrate 
medium, and So is the permittivity of the medium surrounding the scatterer. 
Gjj is the image dyadic Green’s function. 

It is possible to represent the summation equations in terms of matrix 
equations consisting of N dipoles. The matrix equation for the entire problem 
can be represented as 

(B + A + R)P = (5) 

For N total dipoles, B is the diagonal matrix of the inverse of the pola- 
rizabilities 



B = diag{a 
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and A is the matrix that corresponds to (4) which is for direct interaction 
between the dipoles and i? is the matrix that corresponds to (4) for the dipole 
reflected field interaction. A and R each consists of 3x3 submatrices 
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where A^j and Rij are submatrices representing each dipole-dipole interaction 
Green’s functions. These submatrices are described in Schmehl et. al [30]. P, 
the dipole moment vector, and Ei„c, incident held vector, are composed of 
N 1x3 vectors: 



E — [Pi---Pn] ; E^nc — ]-^inc,l----^mc,Ar] (8) 

where Pi is the dipole moment vector at dipole i, and Einc,i is the incident 
held vector at dipole i. 

The solution procedure needed to solve for P in (5) usually requires an 
iterative method. Direct inversion of the matrix (^B + A + R) to solve (5) is 
impractical if large number of dipoles are required to model the scattering 
feature. A major concern when using the DDA method is the iterative proce- 
dure used to solve the system of dipole equations. Draine [13] used a complex 
conjugate gradient {CCG) method to solve the system of equations for a free 
space scattering features. The range in which accurate convergence to a solu- 
tion could be obtained was determined to be a function of the relation of the 
dipole spacing used to the wavelength of the incident electromagnetic wave 
within the scatterer. Thus, as the refractive index of a scatterer increased, 
decreasing the wavelength within the scatterer, the dipole spacing must be 
decreased. With the introduction of a scattering surface, the condition of the 
governing matrix relation is modified and convergence using a general CCG 
method developed for symmetric matrices may not be achieved. It became 
necessary to And other iterative procedures that allowed for convergence to 
be achieved more readily, even as the condition becomes worse and the matrix 
becomes less symmetric. Nebeker et. al [41] studied of several methods for the 
DDA method with surface interaction. It was found that the quasi-minimal 
residual {QMR) method of Freund [42] provided better convergence charac- 
teristics when compared to the CCG method and the bi-conjugate gradient 
(BCG) method. 

3.2 Far-Field Computation 

The dipole moment distribution within the scattering feature, P, is used to 
determine the scattering response in the far-fleld. In this section, we show how 
to compute the scattered held from dipoles above a surface by separating the 
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Fig. 2. Geometrical configuration for far-field scattering. 



scattering into two components: the field direct from the dipole to the point 
of observation, and the field from the dipole to the point of observation which 
is reflected from the surface. 

The basis of this analysis is that the point of detection of the scattered 
field is in the far-held region. The far-field can be defined by the following 
conditions: 



\ri\»\rj\ and ^2 jr^l >> 1 (9) 

where Vj is the radiating dipole position vector, and is the point of obser- 
vation. Equation (9) describes the far-field at is at a distance far from the 
scatterer (at r) when compared to the scattered field wavelength. 

Application of these far-field limitations to the Maxwell equations leads 
to the relation for the direct scattered field from N dipoles to the far field at 

Vi'. 



sca,dir — ^2 4 ^ ^ i^sca'^ j) [d j (1^) 

J = 1 

The dyad vector Crir projects the vector pj in the direction perpendicular 
to Cr- These projections can be represented in spherical 9 and <j) components, 
or they can be resolved in the scattering frame (61,62,63) shown in Fig. 2. 
When the scattering field of (10) is decomposed into the scattering frame, we 
get 



sea, dir (^i) — 
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For the field components that have been refiected from the surface, the 
far-field can be approximated using the reflection coefficient approximation 
{RCA), which models the scattered wave interaction with the surface by 
using the Fresnel reflection coefficients. The equation for this refiected field 
can be based on the equivalent of 11, and including the Fresnel coefficients is 



^ sca.ref i) — ^2 .1 

Anri 

N 

X ^ exp{-iki^sca ■ rj) [R'^^ {pj ■ ei) Ci + R^^ {pj ■ S2) 63] 
i=i 

( 12 ) 

where kj^sca = {ksca,x,ksca,y, ~ksca,z) IS the image scattering direction wave 
vector. R^^^and are the transverse magnetic {TM) and transverse el- 
ectric (TE) Fresnel reflection coefficients respectively. 

By summing the direct and refiected scattered far-field contributions, 
E sea, dir and Esca,refi from each of the N dipoles, the field at an observation 
point i is 



-f 

^ sea V i) — ^2 t 

Anri 
N 

sc j exp{-iksca ■ rj) \{pj ■ ei) ei -I- {pj ■ 63) 63] -I- 
^ \ exp(-tfe/,sco • rj) {pj ■ ei) ei -b {pj ■ 63) 63] 

(13) 



Experimental results for scattering from the features on surfaces are collec- 
ted in the form of differential scattering cross-section, dCsca/d^- Differential 
scattering cross-section is defined as the energy scattered per unit time per 
unit solid angle about a certain direction. Bohren and Huffman [44] derived 
an expression for this: 
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(14) 



where Isea is the scattered irradiance, A is the detection area, r is the distance 
from the feature to the observation point, Rne is the incident irradiance, and 
O is the detection solid angle. 
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Fig. 3. SEM photograph of a spherical particle on a substrate. 

4 Application of the DDA Method and Comparison 
with Experimental Results 

4.1 Feature Characterization/Dipole Moment Distribution 
Prediction 

In this section, we discuss the prediction of the differential scattering cross- 
section of a scattering spherical feature on a flat substrate shown in Fig. 3. 
We then compare the predictions to experimental results found at Arizona 
State University. The scattering feature considered is a 0.482 /im diameter 
polystyrene latex (PSL) sphere on a silicon surface. The polarized and mo- 
nochromatic illuminating beam is p - polarized (TM), has a wavelength of 
632.8 nm, and is directed at a 45° angle from the surface normal. The first 
step in the modeling process is to determine the lattice arrangement for the 
dipoles needed to represent the spherical feature. To accurately model the in- 
ternal field of a scatterer, an element (dipole) spacing of less than one-tenth 
of the internal field wavelength within the scatterer is required. For a 632.8 
nm wavelength, PSL has a refractive index of 1.59, therefore, to satisfy the 
modeling requirement, the dipole spacing should be less than 



dmax = O.IA/ \npart \ = 0-1 (0.6328/im) /1. 59 = 0.04/im (15) 

For the case considered, the x, y, and z axes of the sphere was filled 
out by 20 dipoles as shown in Fig. 1, thus a dipole spacing of 0.0241 pm 
was used, significantly smaller than the maximum allowable spacing. Fewer 
dipoles could be used to satisfy the dipole spacing requirements, but the 
larger number allows a more accurate approximation of the spherical shape 
when using the rectangular lattice array spacing. 

To recap, the input parameters required to arrange the dipole lattice 
include the feature size, refractive index, and incident beam wavelength. Once 
the dipole lattice has been arranged, the surface can be characterized by 
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designating the refractive index of the surface material. The refractive index 
of a silicon substrate at a 632.8 nm wavelength is (3.88, 0.023). The incident 
beam is further characterized by indicating the polarization (j> or s), as well 
as the incident direction vector. Once these parameters are designated, the 
dipole matrix equation, (5), can be solved, predicting the internal dipole 
moment distribution of the scatterer, P, and then the far-held scattering 
quantities can be found. 



4.2 Far-Field Prediction 

Once the dipole moment distribution is found using an iterative matrix so- 
lution process, the scattered electric far-held can be found. The scattering 
feature - detector system conhguration must be known to solve (13). The 
scatterer - detector arrangement used for this study is shown in Figs. 4(a) 
and 4(b). The circular detector, approximately 1 inch in diameter, consists 
of 32 ring-shaped and 32 wedge-shaped photodetectors. The center of the 
concentric ring detectors are at angles ranging from 0.6° to 62.7° from the 
specular scattering direction, 45° from the surface normal vector. The wedge 
detectors range in the (p direction from —90° to 90° from the center. The 
center of the detector is placed 8.2 mm from the scattering feature. The 
diherent shaped detectors are used to determine different scattering charac- 
teristics. Results from the ring region provide information about the size of 
the scattering feature, and the wedge section results can provide information 
about the feature shape. 




Fig. 4. (a) Feature/detector configuration used to determine scattering characteri- 
stics (side view), (b) Ring/Wedge detector configuration. 



Computationally, the scattered parameters are determined at discrete 
points, Tj using (13), though the actual ring and wedge detectors occupy finite 
areas. To approximate the detector response, the electric field is computed 
at a number of points that lie within the detector region. These fields are 
then integrated throughout a single detector region (one ring or one wedge), 
and the integrated result is considered to be the equivalent predicted field 
measured by a detector. 
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Figures 5(a) and 5(b) show comparisons of the numerical predictions 
and experimental differential scattering cross-section results for the ring and 
wedge detection regions respectively. We see that the predictions and experi- 
mental results are in good agreement. In the ring region, the predictions are 
5% to 60% higher than experiment, while in the wedge region agreement bet- 
ween prediction and experiment are within 50%. In the semiconductor indu- 
stry, agreement within these ranges are considered to very good, considering 
all the factors which may cause variation between experiment conditions and 
conditions applied to the numerical model. Experimental uncertainties such 
as beam power, beam width, and detector/feature separation distance can 
account for the difference between experiment and prediction. For example, 
when a symmetrical scattering feature such as the sphere is considered, we 
would expect the wedge data to be symmetrical about the </> = 0° angle in 
Fig. 5(b). However, the experimental results are slightly non-symmetric ab- 
out (j) = Q° . One reason for the non-symmetry may be that the alignment 
of the photodetector was not exactly perpendicular to the plane of beam 
incidence. Other causes could be deviation of the sphere diameter from the 
specification of the manufacturer, and/or the presence of an oxide layer due 
to exposure of the surface to the environment. 



Ring Delector Re^on 





Fig. 5. Differential Scattering Cross-Section, comparisons between experimental re- 
sults and numerical predictions: (a) ringed region; (b) wedge region. Measurements 
by Starr [37]. 



5 Conclusions 

The objective of this work is to show that a numerical model based on the 
DDA method can accurately model scattering from features on surfaces. The 
models using the DDA method are being used in the semiconductor indu- 
stry to help in the development and calibration of surface scanning inspection 
systems used in the manufacturing of semiconductor products. Other compa- 
risons between experimental work and the numerical predictions of the DDA 
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method for features on surfaces have been made at Arizona State University. 
[28, 30, 31, 32, 33, 37] These references show that with careful experimental 
procedures and accurate modeling, the scattering characteristics of features 
on surfaces can be accurately predicted using the DDA method. 
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Abstract. Photonic crystals are formed of a regularly patterned microstructured 
material, with a pitch comparable to the optical wavelength. These materials exhi- 
bit unusual optical properties due to the interference of the light scattered at the 
different interfaces, and this has remarkable consequences such as the formation of 
photonic band gaps. One way of making 2-dimensional photonic crystal materials 
is to use the technology of optical fibre fabrication. We describe how new optical 
properties arise in such media, and how they can be used to form novel optical 
waveguide structures. 



1 Introduction 

Several existing and developing technologies are capable of producing 2- or 3- 
dimensionally periodic arrays of scatterers - artificial crystals with a pitch of 
the order of microns or less. Because in some ways the behaviour of photons 
in these materials is analogous to that of electrons in naturally-occurring 
crystals, such microstructured materials have become known as photonic 
crystal materials [1]. These materials enable new forms of optical elements 
(waveguides, filters, routers...) and could greatly increase the efficiency and 
speed of active optical devices. Consequently, they have the potential to have 
a substantial impact on optics and optoelectronics. The complexity of the 
interaction of electromagnetic waves with these materials is evidenced by the 
substantial world-wide research effort in this field over the past decade [2]. 
Laboratory tests at sub-optical frequencies have demonstrated the validity of 
the basic theories of electromagnetic waves in these structures, for example 
demonstrating that there can be ranges of frequency within which light is un- 
able to propagate inside the material. However, the relatively slow progress 
towards the demonstration of genuine device designs at optical frequencies is 
a reflection of the several remaining hurdles facing workers in the held. Some 
of these remaining challenges are: 1) the fabrication of samples with the re- 
quired pitch, uniformity and refractive index contrast, 2) designing devices 
in such a way as to avoid the problems due to diffraction of light from 2- 
dimensionally periodic structures, 3) providing the mechanisms for coupling 
efficiently into and out of the periodic regions, and 4) comprehensive ex- 
perimental characterisation of completed samples. This chapter describes an 
investigation into 2-dimensionally periodic materials being used in the design 
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and demonstration of new types of optical fibre waveguide [3-6]. In order to 
understand the optical properties of the new waveguides it is important to 
understand the nature of the effects at work in the photonic crystal material 
which will form the waveguide cladding. 

The analogy with the theory of electronic conduction in semiconductors 
provides a historical framework for our understanding of the interaction of 
light with a photonic crystal material. Specifically, just as a large number of 
atoms coupled together in a crystal lattice can give rise to electronic conduc- 
tion bands and band gaps, so in a 2-dimensional photonic crystal scattering 
from several surfaces within the material can cause ranges of frequency where 
electromagnetic propagation is not possible. A large refractive index contrast 
between the two phases of the microstructured material ensures a significant 
reflection at each interface, so that only a relatively small number of surfaces 
are required for strong reflection. This strengthens the effect and makes it 
possible to And ranges of /? - the component of propagation constant perpen- 
dicular to the periodic plane - for which there is a complete photonic band 
gap, independent of direction with respect to the crystal lattice [7]. Within 
this range of (3 there are no propagating modes, and the only allowed elec- 
tromagnetic fields are evanescent. This suggests that the optical properties 
of photonic crystal materials can be modelled as a multiple-scattering pro- 
blem [8], where the number of scatterers which one needs to consider might 
be relatively large but remains finite. It further suggests that scattering stu- 
dies would be a fruitful way of investigating the optical properties of these 
microstructured samples. 



2 Scattering in Periodically Microstructnred Materials 

Previous treatments of multiple scattering by cylinders or more usually sphe- 
res have concentrated on the scattering by a few particles which are arrayed in 
an irregular fashion, and which are illuminated from free space [9,10]. Con- 
vergent solutions have been obtained, and evaluated numerically for many 
simple configurations. Simple experiments have been performed to verify the 
performance of the scattering algorithms [11,12]. Such algorithms have several 
drawbacks as far as the present work is concerned: they converge extremely 
slowly for the relatively large number of scatterers involved in the scattering 
process and they do not take advantage of the periodic nature of the material 
structure to optimise the calculations. On the other hand the methods ori- 
ginally developed for studying the scattering of x-rays in naturally-occurring 
crystals have much to recommend them: the use of inverse-space diagrams 
(fc-vector diagrams) lends itself naturally to the solution of problems invol- 
ving periodic materials. Much has been made of the series of methods derived 
from the analysis of electronic properties of naturally-occurring crystals, the 
plane-wave method in particular [13]. Although these methods are admira- 
bly suited to the numerical analysis of continuous periodic materials they 
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do not in themselves provide solutions to the scattering problem, in which 
the interface with the modes of free space are of prime importance. ’’Scat- 
tering” in this context can be applied to two rather different situations: in 
a conventional sense (when the microstructured material is illuminated from 
free space by a propagating beam) and in a rather different sense where the 
interaction of the material itself with a time-harmonic electromagnetic field 
is modelled as being due to a continuous process of multiple scattering (e.g. 
using a Green’s function approach). In working towards our objective of wa- 
veguiding in a microstructured material by a photonic band gap effect, the 
second of these is the fundamental effect which we are interested in, while the 
first promises to be a useful characterisation tool in assessing our progress 
towards that goal. 

The waveguides to be discussed here are made by forming the required 
2-dimensionally periodic structure on a macroscopic scale in the form of a 
preform, and then reducing the cross-sectional scale of the structure by several 
orders of magnitude by drawing this preform down into a fine fibre on a fibre- 
drawing tower at an elevated temperature. The preform is made by stacking 
together in a close-packed arrangement hundreds of silica tubes and rods, 
each of which is typically 0.75 mm in diameter and 40 cm in length. The 
preform is then of the same length and perhaps 15 mm in diameter, while 
the final fibre into which the preform is drawn - which can be many kilometres 
in length - has a diameter in the range of 30 — 200 /im. Careful adjustment 
of the temperature at which the fibre is drawn allows one to maintain the 
periodic pattern of air holes in the cross-section of the final fibre, where the 
pitch of the ’’crystal lattice” is of the order of microns. The air holes , which 
can be as small as 50 nm in diameter, run down the entire length of fibre, 
giving an aspect ratio of the order of 10®. A schematic representation of a 
fibre, which we call a photonic crystal fibre (PCF), is shown in Fig. 1. 

Photonic crystal fibre differs from all other 2d photonic crystal materials 
in that it can be made in virtually limitless lengths. This is a major depar- 
ture: it means that PCF represents a truly three-dimensional material, with 
a periodic patterning in two of these dimensions. Thus in PCF we have the 
opportunity to investigate a two-dimensionally periodic material without in 
any way restricting interest to the periodic plane. One result of this is that 
it is possible to obtain band gaps for relatively small values of refractive 
index contrast - in-plane band gaps are not possible for the silica/ air ma- 
terial system being described here. This is due to the greater reflection at 
each interface which occurs for out-of-plane incidence - we have taken this 
effect to an extreme case in demonstrating photonic band gap effects in a 
honeycomb lattice structure, as described in Sect. 6. A second result of the 
three-dimensional nature of the fibre is that PCF represents an ideal way to 
fabricate useful photonic crystal devices for a three-dimensional world. Our 
major interest in this type of structures is exactly this: we aim to design, 
fabricate and demonstrate waveguiding optical fibres which are completely 
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Fig. 1. Schematic representation of a photonic crystal fibre. The arrows indicate 
the component of the wavevector k which is parallel to the fibre axis. 



different to conventional waveguides. Instead of using existing materials with 
different refractive indices to define the waveguiding structure, we have en- 
gineered new microstructured materials which have the desired properties. 



3 Scattering from Photonic Crystal Fibres 

Our interest in the scattering properties of photonic crystal fibres lies in the 
use of scattering patterns as an indicator of the band structure of the photonic 
crystal - not so much the common scattering processes (such as Fraunhofer or 
anomalous diffraction), but the multiple scattering process that takes place 
within the microstructured fibre. Thus, it is important that we investigate 
a suitable range of incident propagation constants - side-illumination of the 
fibre from air even at grazing-incidence (i.e. beam direction parallel to the 
fibre axis) can only give a maximum /3-value of the free-space wavevector 
[14,15]. However, our samples have an inter-hole spacing substantially greater 
than A/2n, so that the fundamental Bragg conditions lie beyond the critical 
angle for a silica/air interface. In order to access higher wavevectors f3 than 
k we have embedded fibre samples in an oil with a refractive index close 
to that of silica {us = 1.45), thus enabling /3- values up to Ugk for oblique 
illumination. This is done by mounting the fibre sample as shown in Fig. 2. 

A short length of fibre (typically 25 mm) is stoppered at both ends using 
glue and is mounted along the ax;is of a cylindrical cell, which is fitted with 
transparent windows at both ends. The cell is placed horizontally on rotation 
stages enabling it to be rotated about its own axis and also to be rotated 
in the horizontal plane. The fibre is then illuminated with a focussed laser 
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Fig. 2. Schematic diagram showing the experimental confignration nsed to record 
scattering patterns. Focussed light incident from the left enters the silica cell, and 
scatters off the photonic crystal fibre. The scattered cone of light falls onto a screen 
on the right hand side of the diagram, which is imaged onto a vidicon camera. The 
angle and the orientation of the fibre abont its own axis, can be varied. The 
nnscattered light falls on the screen at <j) = 0 



beam (A = 1.5 /xm, NA = 0.008) through one of the end windows, and 
the scattered light passes through the other window before falling onto a 
screen. Illumination at a single angle C thus gives rise to scattering into a 
cone, which appears as a ring on the screen [16]. This screen is imaged onto 
a vidicon camera and the video signal from the camera is acquired using a 
frame-grabber, enabling the acquisition of all the main features of the entire 
scattering pattern of a fibre at a particular angle C. 

For small air holes (as shown in Fig. 3), the object can be modelled as a 
coherent array of Rayleigh scatterers embedded in a silica matrix, giving rise 
to Bragg scattering at particular angles of incidence. 

Assuming the first Born approximation, the angular width of the Bragg 
peaks is determined roughly by the number of scatterers involved in the pro- 
cess. Because the silica matrix is index-matched to the oil in our experiment, 
there is virtually no scattering from the surface of the fibre in this case. 
Thus, the scattering is characterised by sharp features appearing for parti- 
cular angles C, with very little scattering at other angles. Most of the light 
passes through the sample without being scattered, at all incident angles. An 
example of a scattering pattern observed from a fibre with a triangular array 
of small air holes (with diameter d < 0.2 /xm spaced by A = 2.2 /xm) is shown 
in Fig. 4. The incident angle has been adjusted so as to coincide with one of 
the Bragg points of the fibre. The unscattered light causes the bright spot at 
(f) = 0°. The Bragg scattering angles for a fibre with a particular lattice type 
and pitch are readily predicted using the inverse lattice and the Brillouin 
zone boundaries. In Fig. 4b the filled circles show the array of inverse lattice 
points for the triangular lattice, while the broken lines show the Brillouin 
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Fig. 3. A scanning electron micrograph of a PCF with small air holes, as used to 
observe Bragg scattering patterns. The fibre shown has a pitch of 1.3 /rm 




a) b) 

Fig. 4. (a)shows a scattering pattern observed from a fibre such as that in figure 3 
excited at one of the Bragg conditions. The unscattered light is on the left at 0 = 0. 
(b) shows the Brillouin zone diagram for the triangular lattice. The Brillouin zone 
boundaries are indicated by dotted lines. A heavy arrow shows the incident fc-vector 
used to excite the Bragg condition in (a) 



zone edges. To predict the Bragg scattering points, one draws an arrow from 
the origin in the direction corresponding to the direction of incidence on the 
crystal. The arrow represents the incident in-plane wavevector, so the length 
of the arrow is simply If the tip of the arrow falls on a Brillouin zone 
boundary, Bragg scattering will occur. The directions of this scattering can 
be predicted from the directions of the Brillouin zone edges at this point. In 
the example shown (Fig. 4a) the light is incident along the F — X direction 
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and at an angle which corresponds to the arrow shown in Fig. 4b, giving rise 
to the attractive and highly symmetrical scattering pattern shown. 

A significantly more complicated situation can arise when the size of the 
air holes increases so that they become substantial. In this case each air hole 
can no longer be approximated as a Rayleigh scatterer, while the signifi- 
cant volume of scatterers in the medium results in a rapid attenuation of 
the field as the light moves through the crystal at some angles. Furthermore, 
there is substantial scattering at the interface between the index-matching 
oil (which has the index of pure silica) and the fibre (which has, effectively, 
a lower refractive index) .This last difficulty is compounded by the fact that 
the fibre has a hexagonal external surface (see Fig. 3), so that more than 
one interface is involved for any angle of incidence. These effects give rise 
to intricate scattering patterns with very different general features to those 
observed from small-air-hole samples. Specifically, strong scattering features 
are observed at all angles of incidence, in contrast to the sharp angular fea- 
tures observed from the small air-hole fibre. As in the small air-hole case, the 
patterns are dominated by a few strong and symmetrically-positioned peaks. 
Unlike the small air hole case, however, the patterns are heavily weighted in 
the backward direction, and we attribute this to the failure of the first Born 
approximation . 

To understand the observed scattering patterns we need to know some- 
thing about the possible modes within the photonic crystal material, and 
how these will be excited in our experiment. In any air-silica fibre, there is 
a maximum attainable value of /3 - the largest propagation constant possible 
parallel to the fibre axis. We refer to this as the propagation constant of the 
fundamental space-filling mode, Pfsm- The modal index f3fsm/k associated 
with this mode of propagation must be below that of pure silica, although 
it can approach that of silica for very small air holes and for short wave- 
lengths. Numerically, the propagation constant of this mode can be readily 
estimated [4]. As a result of this maximum /3- value the liquid/fibre interface 
can support total internal reflection for sufficiently small angles of incidence 
from the liquid side. At such small angles we expect that the fibre will then 
act as a perfect reflector. Knowing the value of Pfsm allows us to calculate 
the critical angle for the oil/fibre interface. The scattering pattern in Fig. 5 
was recorded for incidence along the T — J direction at just beyond the cri- 
tical angle, and shows a small amount of unscattered light (at 4> = 0°) and 
two bright spots at 120° from the forward direction. These two bright spots 
correspond to the simple total reflection from the effective refractive index 
interface at the fibre surface (see figure inset). These occur for a range of 
angles from the smallest experimentally attainable up to a maximum angle 
of about C = 8.5° where other features start to appear in the scattering dia- 
gram. The maximum angle at which the scattering patterns can be explained 
by simple reflections from the external surface is in excellent agreement with 
the critical angle calculated from the computed value of f3fsm- 
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Fig. 5. Scattering pattern observed from a large-air-hole fibre sample at a small 
angle ^ (large /3- value). The incident /? is greater than the f3fam for the structure, 
so that the scattering pattern is simply due to reflection from the external surfaces, 
as shown in the inset on the right 



At larger angles of incidence we expect to see complex scattering pat- 
terns due to imperfect mode-matching at the fibre surface, while in certain 
discrete angular ranges (when we have chosen a /3-value corresponding to a 
band gap of the microstructured material) we expect simple patterns very 
similar to that in Fig. 5. The observed scattering patterns do indeed show 
the appearance of more complicated features at angles slightly larger than 
that used in recording Fig. 5. These more complex scattering patterns result 
from partial mode coupling at the fibre surface under conditions where there 
are some allowed modes within the fibre. We have yet to see clear evidence 
in the scattering patterns of a complete photonic band gap, even in those 
samples where theory predicts that such a gap should occur. We believe that 
this is due to the departures from perfect periodicity in the fibre structure 
close to the external surfaces. We plan to concentrate our future studies on 
those fibres where we have direct experimental evidence for the existence of 
a band gap (see Sect. 6, and where electron microscopy shows that the fibre 
structure maintains its periodicity right to the external surface of the fibre. 



4 Design and Fabrication of Photonic Crystal Fibre 
Waveguides 

In order to use photonic crystal fibre as the cladding of an optical waveguide 
we need to use the periodic material under one of the two circumstances 
when we can expect to find total reflection from the structure, viz. incidence 
upon the structure with a /3- value above the largest possible /3 in the periodic 
material (/3 > Pfsm - see Sect. 5), or incidence with a /3- value corresponding to 
a band gap of the periodic material. The first condition can only be satisfied if 
the waveguiding core is formed by a ” high-index” defect within the structure 
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- a region containing a reduced amount of low-index material. This is most 
readily realised by replacing a single one of the capillaries in the stacking 
process with a solid cane of similar external diameter - we refer to this type of 
fibre as ’’index-guiding”, as the guidance mechanism is in many ways similar 
to that in conventional optical waveguides. On the other hand, ’’band gap” 
waveguiding is possible even when the crystal defect comprises an extra air 
hole (a ’’low-index defect”), and using just such a defect to demonstrate the 
effect has the advantage that the index-guiding mechanism can be excluded. 
In the next two sections we describe briefly the main observed features of 
these two types of guidance. 

5 Waveguiding by Total Internal Reflection from an 
Effective Index Structure 

A scanning electron micrograph showing an example of part of a cross-section 
through an index-guiding fibre structure is shown in Fig. 6. 




Fig. 6. Portion of the cleaved face of an index-guiding photonic crystal hbre. The 
site of the missing air hole constitutes a ’’high- index defect” in the photonic crystal 
lattice, causing the appearance of trapped modes localised within this region. The 
guided modes have propagation constants f3 greater than the largest value of pro- 
pagation constant found in the periodic region d/sm, and can thus be considered 
to arise as a result of total internal reflection from the silica/air structure. 



The fibre shown is typical of those which we have studied. The parameters 
of the fibre are A = 2.3 /tm and air hole size D = 400 nm. Our observations 
of the waveguiding properties of the fibre shown in Fig. 6 are that the fibre 
supports a guided mode which is confined to the vicinity of the ” high-index” 
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Fig. 7 . Modal intensity pattern recorded at a cleaved output face of an index- 
guiding photonic crystal fibre. The light intensity peaks in the centre of the ” lattice 
defect” site which has the missing air hole, and has deep minima at the positions 
of the nearest air holes. 



defect in the structure. The observed modal intensity pattern observed for a 
wavelength of A = 633 nm is shown as a contour plot in Fig. 7. The guided 
mode is seen to be tightly confined to the vicinity of the defect and to be 
structured so as to avoid the nearest air holes. The guided mode pattern is 
independent of the way in which the light is coupled in to the fibre, and of 
any bends and twists in the fibre length: the observed modal profile is that 
of the fundamental guided mode, and no other confined modes are excited. 
Very similar patterns to that shown in Fig. 7 are observed at all excitation 
wavelengths between A = 1.5 /rm and A = 337 nm, a variation in wavelength 
of a factor of more than 4. Likewise, by making fibres similar to that in 
Fig. 6 but with a different scale (up to a factor of 5 larger) we find that 
there is but a single robust guided mode. Such behaviour is quite different to 
that observed in conventional fibres, where increasing the core diameter or 
decreasing the wavelength by more than a factor of 2 will introduce higher- 
order guided modes. For very long wavelengths (A >> A) the behaviour of 
our fibre is expected to approach that of a conventional optical fibre with 
a core refractive index equal to that of silica and a cladding index equal to 
the root-mean-square volume-weighted index of the silica/air structure. For 
shorter wavelengths, the morphological microstructure of the fibre cladding 
results in a microstructuring of the optical fields within the cladding. As a 
result, the apparent optical properties of the cladding change as a function of 
wavelength (or, more correctly, as a function of A/X). This variation in the 
’’effective refractive index” [4] is closely related to our observation that the 
fibre can be ’’endlessly single-mode”, not supporting a second bound mode 
for any wavelength no matter how short. Likewise, other observed properties 
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of the fibre are different to those familiar from conventional waveguides. For 
example, the bend loss characteristics and the dispersion [17] of the fibre are 
quite different from those in normal optical fibres, especially in the regime 
where a conventional fibre would be overmoded. 

6 Band Gap Waveguiding 

Although it is possible (at least in principle) to form an optical waveguide 
by using the band gap formed by a triangular array of air holes as a cladding 
material, we have not yet observed such guidance. This can be attributed in 
part to the difficulty of fabricating samples with a sufficiently high degree of 
uniformity which have a small enough pitch A and large enough air holes D. 
However, we have observed a closely related type of waveguiding, due to a 
photonic band gap in a honeycomb lattice. The requirements for uniformity 
are relaxed in this case because the honeycomb structure has intrinsically 
broader band gaps [18]. This is especially true for substantially out-of-plane 
propagation [19]. An electron micrograph showing such a fibre is presented 
in Fig. 8. This fibre was fabricated by drawing down a stack formed from 







Fig. 8. Scanning electron micrograph showing a honeycomb ’’band gap” fibre. In 
this structure, light becomes trapped in the vicinity of the extra air hole in the 
centre of the fibre (a ’’low-index defect”). The confinement in this case is due to 
the band gap of the surrounding band-gap material, not total internal reflection. 



solid silica canes as well as hollow capillaries. An extra air hole in the middle 
of the stack forms a region with an increased air fraction - a ’’low-index” 
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defect in the honeycomb lattice. Instead of a honeycomb lattice of air holes 
the fibre shown in 8 can be usefully considered to be a triangular array of 
almost close-packed silica rods. Each of these rods would, if isolated, support 
waveguide modes with distinct values of propagation constant. When a large 
number of these are coupled together, each mode splits into a large number 
of closely-spaced modes, leading to transmission bands through the material. 
However, the composite material can also have regions of (3 where no mo- 
des exist. For our samples, numerical modelling [19] predicts the appearance 
of such band gaps at visible frequencies. By introducing a low-index defect 
into the structure, we can create localised modes - confined modes which 
are trapped to the vicinity of the defect by the band gap of the surrounding 
material. It should be noted that the band gaps which we have been study- 
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Fig. 9. Near-field intensity map observed from a defect-guided mode in a honey- 
comb band-gap fibre, such as that shown in figure 8. The mode has been excited 
using a laser source (A = 528 nm). The six symmetric lobes appear around the 
central ’’defect” hole in the honeycomb structure(see Fig. 8. The corresponding far- 
field pattern (not shown) has a zero on the axis, indicating the presence of phase 
reversals in the guided mode profile. 



ing in the honeycomb structure as shown in Fig. 8 differ from those formed 
in a triangular lattice in an important respect: band gaps in the triangular 
structure arises from a strengthening of Bragg scattering [7], and as such can 
potentially appear over a very broad range of f3. The ’’coupled resonator” 
model [7] outlined above for the honeycomb band gap suggests that the gaps 
in this structure will appear for relatively large values of (3. Indeed, for such 
large values of f3 propagation is in general forbidden within the material, 
light being evanescent in the air gaps. Propagating modes in this regime only 
arise at specific /3-values corresponding to resonances of the individual silica 
strands, which when coupled together form ’’band windows” within the broad 
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forbidden region. Thus, band gaps are likely to be broader and intrinsically 
more stable than those formed in a triangular structure for smaller /3-values. 
While there is no evidence that low-/3 Bragg-type band gaps cannot appear 
in a honeycomb structure, there is at present no reason to believe that these 
could be more easily observed than in a triangular lattice. Bragg-type band 
gaps in a photonic crystal fibre are an especially attractive goal because it is 
possible (at least in principle) to use such band gaps to form waveguides in 
which the guided mode is trapped within an air hole. Naturally, to form this 
mode would require a band gap with a relatively low value of /3. Such a wave- 
guide would have obvious applications. Nonetheless, the ’’coupled resonator” 
band gaps can be used to form confined modes within a honeycomb structure, 
and these can be observed at low-index defect sites such as that in Fig. 8. 
An example of such a guided mode is shown in Fig. 9. The guided mode field 
profile has variations on the scale of approximately 1 /xm, far sharper than 
those observed in the high-index defect guiding fibre. This indicates a faster 
phase variation in the periodic plane than in the index-guiding case, which is 
directly related to the smaller value of out-of-plane propagation constant /3 
in the present mode. The modal field pattern looks distinctly like a higher- 
order mode of the fibre core, and indeed one can think of it as such. Lower 
modes (with larger values of ( 3 ) will exist, but they will not be confined to 
the vicinity of the core by the cladding material as they fall outside the band 
gap. The guided mode shown in Fig. 9 is confined to the defect only for a 
relatively narrow range of frequencies - it is confined for wavelengths between 
458 nm (blue) and 528 nm (green) but is not confined at 633 nm (red). This 
strong spectral dependence means that very beautiful and brightly coloured 
modal field patterns can be observed in these structures using a white light 
source [6]. 



7 Conclusions 

We have adapted the technology of optical fibre fabrication to produce two- 
dimensionally periodic silica/air photonic crystal materials. These materials 
consist of fine fibres with a regular and 2-dimensionally periodic array of air 
holes running down their length. These air holes can have diameters of less 
than 100 nm and be spaced by distances of the order of 1 /xm or greater. The 
optical properties of these microstructured materials are quite different from 
those of silica, and can usefully be studied by scattering experiments. These 
new properties have enabled us to observe waveguiding within the photonic 
crystal fibres by introducing purposeful defects into the photonic crystal lat- 
tice. These photonic crystal fibre waveguides have some startling properties, 
and there is a great depth of work still to be done both in understanding 
those effects which have been observed, and in extending the limits of what 
is technologically achievable so as to observe new effects. 
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Abstract. Sonoluminescent bubbles constitute a new kind of microparticles, which 
cavitate (expand and collapse) due to sound action, with remarkable properties of 
light radiation. The observation of sonoluminescence in bubbles has opened an ac- 
tive research on a phenomenon of light radiation by sound conversion. Experiments 
have been performed on emission spectra, estimation of the bubble size by light 
scattering, assessment of the shape and its possible departure from dipole radia- 
tion angular correlations, and concerning the reactive nature of the gas inside the 
bubble. Several theories have been put forward, like e.g.: those that support the 
sphericity of the bubble; those that establish that it is actually non spherical, the 
emission being due to a jet implosion in which one half of the bubble collapses into 
the other half ; models to explain the dipole strength of a surrounding stream as 
experimentally observed ; and suggestions that the emission should rather originate 
in the liquid enclosing the bubble. However, the key problem of determining the 
size and shape of the bubble at the interval (50 ps) of emission has only recently 
been solved. Angular correlations from intensity measurements of the detected so- 
noluminescent flashes, diffracted by the bubble surface at the emission instant have 
been reported. We review here the theory on the angular distribution of the light 
emitted from several bubble shapes and sizes. This allows to establish which one 
matches with the experiments. For instance, it is found that an ellipsoidal shape of 
1.5 to 2 /rm and eccentricity 0.2 exhibits an excellent agreement with the existing 
data at all ranges of wavelengths. 



1 Tutorial Background 

1.1 Current Problems in Sonoluminescence 

A bubble, produced in liquid, may radiate light under the action of a sound 
wave. The acoustic frequency usually is in the range 26 — 60 kHz and the 
intensity about 110 decibels. This conversion of sound into light is known 
as sonoluminescence. The phenomenon was first discovered in the 1930’s as 
the shining from a water filled vessel with many air bubbles inside. However, 
recently, this effect has experienced a renewed and more detailed research 
[1-3], and now more controlled experiments have been conducted in which 
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Fig. 1. Sketch of an experimental arrangement for light scattering measurements 
by a single sonoluminescent bubble. Basically, two types of measurements can be 
put forward: (i) The light elastically scattered by the bubble, illuminated by an 
external source (HeNe laser), is collected by the detector D1 as a function of time; 
(ii) The light flash emitted by the bubble (i.e., no external illumination is used) 
is collected by two detectors D1 and D2. This two signals can be subsequently 
correlated as a function of the angle between D1 and D2. 



light is produced by a single, very bright air bubble, held in water under the 
action of ultrasound at intensity about 110 decibels (see Fig. 1). Experiments 
in heavy water with the bubble filled by a hydrogen isotope (either H2 and 
D2) have also been performed Another interesting experimental fact is the 
sonoluminescence flash intensity dependence on the reactivity nature of the 
gases contained inside the bubble [5,6]. In particular, it has been shown that 
increasing the noble gas content (Xe, Ar, He) of a bubble stabilizes the bubble 
motion and increases its light emission intensity by an order of magnitude 
with respect the intensity emitted by the non-doped bubble. 

As the pressure in the acoustic wave drops, the bubble, initially with a 
radius of a few microns, starts to inflate up to a radius of about 50 /rm within 
a time interval of approximately 20 microseconds. Then, as the wave acoustic 

^ An extensive overview on the different aspects and new advances in single bub- 
ble sonoluminescence is out of the scope of this work. The interested reader 
can consult the WEB-page maintained by Dr. A. Levinson at the internet ad- 
dress http://ne43.ne.uiuc.edu/ans/sonolum.html. This WEB-page contains use- 
ful and interesting links to introductory-level papers on sonoluminescence. A 
more specialized-level review on the experimental state of the art up to date can 
be found in [4]. 
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pressure starts to increase, the bubble drastically shrinks to 0.5 /tm in about 
5 microseconds. Subsequently, its radius oscillates up and down by a few 
microns until it reaches the initial value. A new sound wave period hitting 
the bubble triggers the same process once again. Light emission occurs as the 
bubble compresses. 

These variations of the bubble size have been analyzed by means of light 
scattering experiments [7]. In these experiments, laser light is scattered by 
the bubble surface and its angular distribution is detected (see Fig. 1). Large 
scattering angles (about 60°) are usually addressed. The intensity lobes are 
smoothed out. Then, a geometrical optics limit, according to which the mea- 
sured intensity is proportional to the the sphere surface, is employed to fit 
the bubble radius to the measured light intensity. However since the WKB 
approximation cannot be very accurate in this regime of bubble sizes, it has 
been recognized in these works that the uncertainty of radius determination 
by this method can be up to 20 percent for sizes of about l/rm. Thus, other 
methods for determining the size, as well as the shape of the bubble, have 
been sought. The last advancement in this respect has been the determina- 
tion of the angular correlations of the intensity of the emitted light, which 
we shall address in the next section. 

The time of light emission during the sonoluminiscent process is over 
50 ps [8-10]. In this interval, assuming a speed of the bubble boundary of 
1000 m/s, its radius changes by 50 nm, and the light is emitted from a 
small region within the pulsating bubble; then this light scatters from the 
bubble surface, subsequently being detected. Due to this small variation of 
the radius compared with its size, about 1 /rm, the angular distribution of 
recorded intensity yields information of a frozen size and geometry [10,11] 
during the emission time. 

The underlying cause of emission is however not clear at all yet. There are 
models that claim the existence of a shock wave due to the rapid movement 
of the bubble on compression, this giving rise to temperatures and pressures 
millions of times the normal ones, thus the air inside the cavity becoming 
plasma [7,12]. If this were true, this would suggest the existence of a fusion 
process. However, no released He atoms nor neutrons have been detected so 
far. Another theory proposes the existence of liquid jets due to the bubble 
oscillations [13-15]. This would induce electric currents which would radiate 
light. Also, there exist theoretical models that predict light emission from 
the liquid enclosing the bubble, due to the electric currents generate at the 
water breakdown [16]. Even the Casimir effect has been proposed as a possible 
mechanism for this phenomenon [17-19]. 

The flash spectrum is very similar to a blackbody one, and, with tempe- 
ratures inside the bubble of the order of 10® K, (and pressures reaching 10^ 
times the atmospheric one), this spectrum peaks within the 200 nm - 300 
nm interval, thus likely being in the ultraviolet region. The peak power is 
about 30 mW. Since however, light at this frequency does not propagate in 
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water, this peak is very difficult to measure. As a single flash of the bubble 
does not contain enough light for data analysis, the authors of [11] measured 
angular correlations from many sonolominiscent shots at either the whole 
emission spectrum, or at red, (A > 500 nm), or blue, (260 nm < A < 380 nm) 
wavelengths. In this review it is seen that those experimental data provide 
a great deal of information, as a matter of fact the only one so far, in order 
to determine the size and shape of the pulsating bubble during the emission 
process. 



1.2 Integral Equations for Light Scattering from Arbitrary 
Bodies 

Let us consider an electromagnetic field, with electric and magnetic vectors 
E(*)(r) and H*^®^(r), respectively, incident on a medium of permittivity e 
occupying a volume V, constituted by multiple scattering volumes, namely, 
V is composed of parts Vi, V 2 ,..., Vat, each of which is limited by a surface Si, 
S' 2 ,..., Sn, respectively, (see Fig. 2). We shall denote by r^ the position vector 
of a generic point inside the volume Vi, and by r^ that of a generic point 
in the volume V, which is outside all volumes Vi. The electric and magnetic 
vectors of a monochromatic field, satisfy, respectively, the wave equations 
[ 20 ]: 



V X V X E(r) - fc2E(r) = Fe(r), (1) 

Vx VxH(r)-fc2H(r)=F,„(r), (2) 

where k is the wavenumber, and Eg and are the source terms that cha- 
racterize the generation of electromagnetic waves. These terms are zero in 
the volume V outside the medium; and inside are: 

Fg(r) = 47t[— j(r) -|- /c^P(r) + ikV x M(r)j, (3) 

Fm(r) = 47 t[-V X j(r) — ikV x P(r) -|- fc^M(r)], (4) 

In (3) and (4) P and M are the polarization and magnetization vectors, 
respectively; and j represents the electric current density. 

Let Q{r,r') represent the dyadic Green function: 

0(r,r') = (I+pVV)G(r,r'). (5) 

G(r,r') being the outgoing spherical wave: exp(tfc|r — r'|)/|r — r'|. Q{r,r') 
satisfies the equation: 



V X V X 0{r, r') — A:^t/(r, r') = At:5{v — v')I. 



( 6 ) 
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Fig. 2. Illustration for the scattering geometry. 



For P and Q well behaved in a volume V surrounded by a surface S the 
vector form of Green’s theorem reads [21]: 



IV 



(Q-VxVxP — P-VxVx Q)dv = 
[ (PxVxQ — QxVxP)- nds, 



( 7 ) 



n being the unit outward normal. 

Let us now apply (7) to the vectors: P = C/(r,r')C, (C being a constant 
vector), and Q = E(r). (The singularity of C/(r,r') at the origin is integrable 
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for s-waves, whereas it is not for p-waves, (cf. e.g. [20] and [22]). Taking (1) 
and (6) into account, we obtain: 



J^E{r')S{r - r')dv = ^ J^Fe{r') ■ G{r,r')dv - ^Se(r). (8) 

where Sg is: 



Se(r) = V X V X 




aC(r,r') 

dn 



G(r,r') 



aE(r') 

dn 



)ds. 



(9) 



Equation (8) adopts different forms depending on where the points r and 
r' are considered. The wavenumber k entering in (8) is: k = k^y/e or k = ko 
according to whether r' is in V or in V, respectively, ko = 27t/A. By means 
of straightforward calculations one obtains: 

a) If r and r' belong to any of the volumes Vi, {i = 1,2, N), namely, V 
becomes either of the volumes Vi'. 



E(r<) 



1 

47T 



Fg(r') • g{r<,r')dv 



1 (j(m) 

47T®* 



(r<). 



(10) 



where: 



g(“)(r<) = V X V X 




9G(r<, r') 

dn 



G{r<,v')^^^)ds. ( 11 ) 



In (11) Eiin represents the limiting value of the electric vector on the surface 
Si taken from inside the volume Vi. Equation (10) shows that the field inside 
each of the scattering volumes Vi does not depend on the sources generated 
in the other volumes. 

b) If r belongs to any of the volumes Vi, namely, V becomes Vi, and r' 
belong to V : 



0 = Sg,gt(r<). 



In (12) Sext is: 



S.«(X) = y:s<">(r<)-S„(r<). 

i 



( 12 ) 



(13) 



where: 



{out) 



(r<) = VxVx [ (E(r')^^^^J^^ -G(r" 
Jsi 






(14) 



In (14) the surface values of the electric vector are taken from the volume V 
outside the volumes Vi. The normal n now points towards inside each of the 
volumes V). 
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Also, Soo has the same meaning as (14), but with the surface of integration 
now being a large sphere whose radius will eventually tend to infinity. It is 
not difficult to see that —Soo in (13) is equal to 47 t times the incident field 
E(d(r<) (cf. [20] and [23]). Therefore (13) becomes finally: 

0 = EW(r<) + ^^s|°“‘)(r<). (15) 

i 

Equation (15) is the extinction theorem for the electromagnetic field in a 
system composed of an arbitrary number of scattering bodies, each of which 
is limited by the surface Si. Note that when this equation is used as a non- 
local boundary condition, the unknown sources to be determined, given by 
the limiting values of E(r') and 9E(r')/(9n on each of the surfaces Si, (cf. 
Eq.(14)), appear coupled to those corresponding sources on all the other 
surfaces Sj, j ^ i. 

Following similar arguments, one obtains: 

c) For r belonging to V and r' belonging to any of the volumes Vi, namely, 
V becoming V: 

0 = Fe(r') • g(r>,r')dv- ^sf”^(r>), (16) 



with given by Eq.(ll), this time evaluated in r^. 
d) For both r and r' belonging to V: 

E(r>) = EW(r>) + ^ E sE‘^(r>), (17) 

i 

Hence, the exterior field is the sum of the fields emitted from each scatte- 
ring surface Si with sources resulting from the coupling involved in the FT 
(15). The other important case corresponds to a penetrable, optically homo- 
geneous, isotropic, non-magnetic and spatially nondispersive medium, (this 
applies to a real metal or a pure dielectric). In this case, (10) and (15) become, 
respectively: 



E(r<) = - 



47rfcge 



V X V X 
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whereas (16) and (17) yield: 



0 = 






V X V X 






( 20 ) 



E(r>) = E^(r>) + x V x 



( 21 ) 



i9n 



In (18) and (20) ”zn” means that the limiting values are taken on the 
surface from inside the volume Vi; note that this implies for both and 

Ein that k = k^y/e. 

The saltus conditions: 



nx [E„(r<)-E(r>)] = 0, n x [H,„(r<) - H(r>)] = 0 (22) 

across the surface St permit to find both E and dE/dn from either the pair 
(20) and (21), or, equivalently, from the pair Eqs.(18) and (19), as both r> 
and tend to a point in Si. Then the scattered field outside the medium is 
given by the second term of (21). 

In the next section, we apply this theory to the light created at the center 
of a sonoluminescent bubble and scattered by its surface. In this case, the 
incident field E^'^ is a spherical wave which hits the bubble surface from 
inside. The non-local boundary condition to use are (20) and (21), slightly 
modified to account for the fact that now the scattering volumes Vi reduce 
just to one V: the bubble, whose surface we shall denote as S. It is within 
this volume V where the incident field exists. Therefore, Eqs. (20) and (21) 
become, respectively: 



0 = E«(r<) -h 



dTrfcg 



V X V 






(23) 



E(r>) = - 



dTr/cgC 



V X V X 



( 2 , 
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2 Distribution of Radiated Light and Angular 
Correlations 

Figures 3(a)-3(d) depict the shape and size of four of the several bubble 
geometries that we have tried for the analysis, and that are in principle com- 
patible with arguments previously reported. These four sections correspond 
to the cut of the bubble by the plane at which the angular distribution of 
emitted light is being calculated as a function of 9 (i.e., the plane defined by 
the position of the two detectors and the center of the bubble) . The light is 
radiated by the point source marked by a dot at the center of each of these 
air bubbles. On scattering of this light wave by the bubble surface, one has 
a scattered field in the sorrounding water which is expressed as [20] : 



^(s,p) (^) = I / 



Id 



dtp} , (r') 



v^(n'-ke)v3p,,p) (r') - * 



dn' J 
exp [i-/ekg.r') . 



(25) 



The integral in (25) is extended to the bubble surface D. e is the permit- 
tivity of water, kg is the wave vector along the observation direction defined 
by the angle 9, . represents the boundary value at the bubble surface D 

of the light wavefield in the surrounding water at the A-wavelength either for 
s (TE waves) or p (TM waves) polarization, n' is the local outward surface 
normal. 

The boundary values p and dp/dn! are found from the non-local bound- 
ary conditions (we omit the subindices and superindices) [20] : 



p{v) 




dG (r,r') 
dn' 



p{r') -G(r,r') 



dp (r') 
dn' 



(26) 



1 

47T 




o='JsGllG,{.y)- 






dn' 



dn' 



(27) 



G(r,r') is the spherical wave Green’s function: exp{ik^JeR) / R, i? = |r — 
r'l, k = 2tt/X, €b is the permittivity of the air within the bubble, c is the 
speed of light in vacuum, Gb = exp{iky/eBR) / R, i? = |r — r'|, pB denotes 
the limiting value at the bubble surface of the light field inside the bubble, p, 
Pb, dpjdn' and dpBjdn' are determined from (26) and (27) and the saltus 
conditions: 



V = ‘Pb, 



(28) 
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a ^dip/dTO.' = ag^dipBldx)! , (29) 

where a and ub are both either equal to 1 for s-polarization, or to e and es 
for p-polariation, respectively. 

The emitted intensity in the water in an interval of wavelengths [Ai, A^v] 
then is: 



A — Ajv 

I{9)= Y. Pxh{9) = YPx\\^U0)\ 






A=Ai 



(30) 



where we have added the intensities over all wavelengths in the above mentio- 
ned interval with relative weights P\, given by the sonoluminiscent spectral 
distribution , P\ reported in [12]. Also, we have considered unpolarized light 
given by the incoherent superposition of the two states of polarization s and 
p in the plane of measurement. 

It is worth to note that the incoherent superposition of scattered intensi- 
ties given by (30) implies the creation of a total effective intensity I (9), which 
is partially coherent [24]. Furthermore, the degree of coherence of I{9) is in- 
versely proportional to the sonoluminescent spectral distribution P\. Namely, 
the degree of coherence of I {9) is narrower the broader P\ is 

The angular distribution of scattered intensity corresponding to each of 
the four shapes is shown in Figs. 3(a)-3(d) at the four different wavelengths 
indicated there. The dielectric permittivities e and cb are 1.77 and 1, respec- 
tively. Even for the elipsoidal bubble, notwithstanding its small eccentricity, 
an interference pattern, on account of the scattering from the bubble surface, 
appears. Note, however, that the contrast of these intensities increases with 
the irregularity of the surface. 

The angular correlation function of intensities is calculated as: 



C{A9) 



{{I {9) - {I (g))} {/ {9 + A9) -{I {9 + A9))}) 
{I{9)){I{9 + A9)) 



(31) 



where (.) denotes ensemble average over the different realizations correspon- 
ding to the random orientations of the bubble at different flashes, as reported 
by the experiments. On using the ergodic property of the random intensities, 
this average can actually be performed over the 0-argument as: (.) = J^^{.)d9. 

^ This issue has important consequences. For instance, in several experimental 
situations, the width of the sonoluminescent temporal pulse is much larger than 
the fluctuation time of the bubble dynamics (i.e., large bubble gas concentrations 
and driving pressures). In this case, our theory inmediately indicates that this has 
a net effect of disminishing the temporal coherence length of the detected light. 
In consequence, much weaker, or even negligible, angular correlations would be 
obtained. Hence, the track about the bubble morphology would be lost even in 
the case in which the bubble collapse is asymmetric. 
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Fig. 3. Angular distribution of scattered intensity by an air bubble (es = 1) inmer- 
sed in water (e = 1.77). The light source is assumed to be point-like at the bubble 
origin (marked in the insets as a point). Four different wavelengths are considered: 
A = 250 nm (solid line), A = 450 nm (long-dashed line), A = 500 nm (short-dashed 
line), and A = 1000 nm (short-dashed line with circles). 3(a): Ellipsoidal geometry. 
Size parameters: a = 1 /rm, b = 0.75 ^m. 3(b): ’Single-jet’ geometry. Size para- 
meters: i?i = 1 /im, i ?2 = 0.5 ^m. 3(c): ’Double-jet’ geometry. Size parameters: 
i?i = 1 /rm, R 2 = 0.25 /rm. 3(d): ’Fourfold’ geometry. Size parameters: Ri = 1 /rm, 
R 2 ~ 0.67 fim. The periodicity in the measured angular correlations [11] excludes 
the geometries of Figs. 3(b) and 3(d). 



It is worth noticing that I (9) and C{9) have both the same angular periodicity, 
a fact that will be used below to select the bubble geometry. 

From the intensity distributions of Figs. 3(a)-3(d) it is seen that only the 
intensity correlation functions of geometries with one axis of symmetry have 
a variation period between 0° and 180°, as established by the experiment 
of [11]. This pertains to the shapes of Figs. 3(a) and 3(c). The correlation 
function for the geometry of Fig. 3(b) has a period between 0° and 360°, and 
that for the shape of Fig. 3(d) has a fourthfold symmetry, thus its period is 
between 0° and 90°. Therefore, this fact rules out the validity of these last 
two geometries. 

On the other hand, concerning the geometries of Figs. 3(a) and 3(c), (el- 
lipse and oval, respectively), we present the angular correlation of the inco- 
herent superposition of scattered intensities, which are distributed according 
to the calibrated spectral density of sonoluminiscence reported in Ref. [11], 
for light scattered from the ellipsoid (Fig. 4(a)) and from the oval (Fig. 4(b)). 

In order to get a realistic comparison with the experimental data, noise has 
been added to each of those signal intensities, such that < I\ >^= 10“^, 
(7^ being the noise variance and < I\ > denoting the mean value of the 
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0 50 100 150 



A0 



Fig. 4. Angular correlation of light intensity scattered by the babble snrface. The 
scattered intensity spectral distribution at fixed observation angle 9 has been nor- 
malized to the full sonoluminescence flash spectrum (data taken from [12], Fig.l). 
4(a): Ellipsoidal geometry. Size parameters: a = 1.5 /im, b = 1.25 fim. 4(b): Double 
jet geometry. Size parameters: Ri = 1 /rm, R 2 = 0.25 /rm. The experimental data 
in Fig. 4(a) (circles) have been taken from [11], Fig. 1. The magnitude of the 
experimental angular correlation ]11] rules out the geometry of Fig. 4(b). 



scattered intensity at the A-wavelength. Taking into account this fact, the 
new correlation function C'{A9) becomes now 



C'{Ae) = C{A9)+a^/{hf, (32) 

where C{A9) is given in Eq. (31). 

The addition of the noise term a'^/{I{9))‘^ has important consequences: 
the interval of values of C is not centered at C = 0, but C appears shifted 
towards positive values, also observed in the experimental values [11] . Figures 
4(a), 5(a) and 5(b) reflect this fact. 
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A0 

Fig. 5. Angular correlation of light intensity scattered by an ellipsoidal bubble 
with eccentricity e = ajh — 1 = 0.2. Three different sizes are considered: a = 1/rm 
(solid line), a = 1.5 fim (long-dashed line), and a — 2 /rm (short-dashed line). The 
scattered intensity spectral distribution, taken from Fig.l of [12], has been splitted 
both into the ’blue’ sonoluminescence flash spectrum (Fig. 5(a), 280 nm < A < 320 
nm), and the ’red’ sonoluminescence flash spectrum (Fig. 5(b), A > 500 nm). The 
experimental data (circles and triangles) are those from [11]. 
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As seen, only the the angular correlation of light scattered from the ellip- 
soid varies in the range between —2 and 4 parts per thousand as reported in 
the experiments of [11]. The one corresponding to light emanating from the 
oval takes on a much a broader range of values, as it corresponds to scattered 
intensities with larger contrast than those from the ellipsoid (cf. Figs. 3(a) 
and 3(c)). Hence, we conclude that the oval shape should be also excluded, 
(incidentally, observe that the even larger contrast in Figs. 3(b) and 3(d) also 
rules out the geometries corresponding to these two figures). 

Once we have concluded that only the ellipsoid geometry is compatible 
with the experiments, we proceed with the search for its size. Figures 5(a) 
and 5(b) show the angular correlation of scattered light for the incoherent 
superposition of intensities, following the sonoluminiscent spectral distribu- 
tion as before. However, now the addition of intensities is performed at two 
diferent intervals of wavelengths, namely, for wavelengths varying either lar- 
ger than 500 nm for red light, or between 200 nm and 300 nm for blue light. 
The bubble is an ellipsoid with a being either a = 1.0 /im, 1.5 /xm or 2 /xm; 
the eccentricity e = ajb — 1 being e = 0.2 in all cases. A one per thousand 
noise have been added to the scattered intensity data as before. From these 
two figures we observe an excellent agreement with the experimental data, 
within the estimated error in the measurements, although Fig. 5(a) indicates 
that the elliptic bubble with a = 2 /xm matches the best for the blue light. 



3 Conclusions 



In this review we have given the theory of light scattering from the surface 
of a sonoluminescent bubble. This study permits to determine the shape and 
size of a sonoluminescent bubble on account of light correlation measurements 
at the emission instant. The conclusion is that the bubble surface is actually 
an ellipsoid of eccentricity very close to 0.2 and major radius between 1.5 /xm 
and 2 /xm. Calculations show that one tenth eccentricity deviations from 0.2 
change by one order of magnitude the amplitude of the correlation function. 

It should be remarked that this theory may be equally adapted to any 
experimental data that might appear in future. The conclusions on the bub- 
ble morphology would vary accordingly. For instance, if future experimental 
data show that the temporal width of the emitted sonoluminescent fiash is 
much larger than the fluctuation time of the bubble hidrodynamic, our theory 
indicates that this fact has a net effect of shortening the temporal coherence 
length of the scattered light. Hence, much weaker, or even negligible, angular 
correlations would be obtained whatever the bubble morphology is. 
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Abstract. Light scattering intensity patterns produced by particles on substrates 
depend strongly on the size and shape of the particles. For some regular particle 
shapes, like spheres and cylinders, minima exist in the intensities and a relationship 
exists between their angular positions and the particle size. This dependence may 
be established using a simple double-interaction model which considers only the 
following: a) the scattering solution for the isolated particle; b) the coherent addition 
of four contributions of the scattering amplitudes to account for the held rehections 
off the substrate; and c) some geometrical factors which include the path differences, 
the Fresnel rehection coefficients, and the geometrical shadowing. This model has 
been used to size particles when illuminated at both normal and oblique incidence. 
For a collection of polydisperse particles on a substrate, scattering patterns can also 
produce very useful information; for instance, the visibility of the minima is directly 
related to the variance in size for low polydispersity samples. For large values of 
polydispersity (more than 30%), the width of the enhanced-backscattering peak 
can provide useful information. 



1 Introduction 

Scattering by microstructures on flat substrates constitutes an important field 
of interest, since it includes particles as contaminants on clean manufactured 
surfaces [1]. Much effort has been devoted to modeling the scattered light 
from these surfaces, including the analysis of different effects and the test of 
approximate numerical solutions [2,3]. Among these effects are multiple scat- 
tering and cross-polarization [4], evolution of backscattering [5], near- field 
effects [6], enhancement of the surface held [7], and surface plasmon genera- 
tion [8,9]. The development of models and approximate solutions contributes 
to our knowledge of the connection between the characteristics of the particle 
as a diffuser and the scattering patterns obtained from it. For instance, light 
scattering from particles on surfaces may sometimes show enough specific in- 
formation to allow a direct estimate of the particle size, therefore constituting 
a simple and non-invasive particle sizing method [10,11]. 

Several features exist in the scattering patterns that may help in the 
inverse problem. These depend on the particular sample of interest and on 
the experimental feasibility of the observable feature. An extensive use of 
theoretical models can assist in the analysis. The simplest method of analysis 
is the angular positions of the minima of certain scattering patterns, and 
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this method is our choice for most of the cases presented in this chapter. 
A great variety of models have been developed to assist us in our study of 
particulate surface. To be useful, the model must be physically meaningful, 
accurate, reliable, adaptable to many experimental conditions, and fast. Our 
choice for satisfying these criteria is the double-interaction model which we 
develop in Sect. 2. Section 3 is devoted to the inverse problem as an important 
application of the model. First, the size dependence of the minima in the 
scattering patterns is discussed, and expressions are obtained to find the 
particle size from the S'-copolarized scattering patterns. The effect of size 
distributions on the scattering patterns is discussed later. 

2 Theory 

2.1 Double-Interaction Model and Shadowing Effects 

A simplified method of the scattering by an object (in our case a sphere or 
a cylinder) on a flat substrate may be constructed by means of the double- 
interaction model [12]. The scattering problem is divided into several isolated- 
particle scattering problems, as pictured in Fig. 1, whose solution can be 
calculated. 






Fig. 1. Geometry of the double-interaction model. The superposition of these four 
components make up the total scattered electromagnetic held. 



The particle of interest is centered a distance h above a flat substrate of 
complex refractive index TOgub and illuminated by a plane wave whose incident 
angle with respect to the surface normal is 9i, taken always as positive, and 
wavelength A. The spherical or cylindrical particles we consider are further 
characterized by their radius R and complex refractive index rupar- From Fig. 
1, we see that the particle is actually illuminated by two different plane waves, 
directly and after a Fresnel reflection from the substrate. The particle then 
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scatters this incident light as if it were isolated. The light is scattered to a 
detector placed in the plane of incidence at observation angle Og. From Fig. 1, 
we see that the scattered light reaches the detector in two ways, one directly 
and the other after a Fresnel reflection from the substrate. There are four 
components to the total scattered held which we number as follows: 1) direct 
incident, direct scattered; 2) direct incident, reflected scattered; 3) reflected 
incident, direct scattered; and 4) reflected incident, reflected scattered. Note 
that this model does not include the multiple interactions that occur between 
the particle and the substrate. 

Each of the four rays shown in Fig. 1 has traveled a different pathlength 
to reach the detector. Each pathlength results in one or more phase shifts 
S{9) which depend on the incident angle or scattered angle {9 = 9i or 9g ) , 
separation distance h, and wavelength. A: 



m = 



Airh cos 9 
A 



( 1 ) 



The amplitudes and phases are also modified by the Fresnel reflections 
occuring when the light rays reflect off the substrate: 



s{0) = 



cos 9 - \lrnl^^ - sin 



COS0 + - sin^ ( 



rp{9) = - 



"iLbCOs6>- 






( 2 ) 



( 3 ) 



The four components of the total held may be written as follows: 



El = Ao A{t: -9,- 9s) (4) 

E 2 = Aq A{9i - 9s)u{9s) exp [ti5(0s)] (5) 

E3 = ^0 A{9i - exp [i5{9i)\ (6) 

E4 = Ao A{tt -9i- 0s)f,(6»i) f*(6>s) exp [i6{9t) + ii5(0s)] (7) 



where r* is the appropriate Fresnel reflection coefficient Fg or Fp depending on 
the polarization state of the incident or scattered ray; Aq is the amplitude of 
the incident held; and A{9) is the complex scattered held given by the isolated 
particle. For a spherical particle, A{9) is the complex electric scattered held 
for an isolated sphere (Mie theory) at a scattering angle 9. The total scattered 
held is the superposition of these four components given by 

4 

Et = ^ Ej 
i=i 



( 8 ) 
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Equations (4-8) represent the double-interaction model in its purest form. 
Model results are fairly accurate and the equations can be evaluated almost 
as rapidly as the scattering results from the isolated particle. The only thing 
we have not considered in this model is the interaction which occurs between 
the particle and the substrate, i.e., the scattered waves from the particle that 
reflect of the substrate and are incident upon the particle. In the case of metal- 
lic particles, we can, however, make a first-order approximation of this effect 
by including the shadowing factors. Consider the third component when 
the incident angle approaches normal incidence — >■ 0°. This component is 

obscured by the particle; i.e., the particle casts a shadow, hence a “shadowing 
effect” occurs for this component. Similarly, for the second component, the 
sphere obscures some of the light that reflects off the substrate when 9s ap- 
proaches zero. Both obscurations are present in the fourth contribution. To 
account for these effects, a shadowing factor Fs{9) can be introduced on these 
components in the model. It is defined as the ratio between the shadowing 
cross-section (the obscured part) in the direction 0, where 9 is either 9i or 9s 
and the total cross-section of the particle. For a sphere at rest on a substrate, 

7T 

and for a cylinder 



Fs{9) = l-sm\9\ (10) 

Including the shadowing factors, (4-7) become 

Ei= AoA{tt-9,-9s) ( 11 ) 

E 2 = Ao A{9i - 9s)r^{9s) exp [t<5(6»s)] [1 - Fs{9s)]^ (12) 

E 3 = Ao A{9i - 9s)n(9t) exp [z5(6>i)] [1 - Fs(9i)]^ (13) 

F 4 = Ao A{-k - 9i- 9s)r^{9i) r^{9s) exp [zi5(6»s) -I- i(5(6>s)] (14) 

x[l-E,(0,)]' [1-E,(0,)]5 



It is interesting to emphasize the distinction between the radius of the 
particle and the distance from its center to the substrate surface. For the 
particle at rest on the substrate both are coincident but, for instance, the 
particle may be partially buried beneath the substrate interface and h < R. 
This is not uncommon experimentally when a sample is coated with a metallic 
layer. For such system, the size of the scatterer is not severely changed, but 
the phase shifts of contributions 2, 3 and 4 are decreased slightly [13]. The 
opposite case, h > R, occurs when a particle is lifted above a fiat substrate. 
This is not uncommon for cylinders at rest on a fiat substrate [14]. 
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2.2 Model Accuracy 

For many applications, approximate results are all that is necessary. The 
exact shapes and heights of the scattering intensity maxima and minima 
may not always be important. Many applications depend on the changes in 
the positions of the maxima and minima with system parameters. We demon- 
strate the accuracy of the double-interaction model in Fig. 2 which shows the 
scattering intensities as a function of scattering angle 9g. For comparison, 
double-interaction results are shown with and without shadowing. Exact re- 
sults are calculated using a method of images [14]. We see that the predictions 
achieved using the simple model agree quite well with the more complicated 
model. The inclusion of the shadowing effect improves the accuracy of the 
model results. 




0^ ( Deg ) 



Fig. 2. Numerical s-polarized scattering patterns corresponding to an i? = 0.55 /rm 
cylinder on a substrate (both perfectly conducting) for 9i = 30° . Results are shown 
for the double-interaction model (solid), double-interaction model with shadowing 
effect (dotted), and exact results calculated using the method of [14] (dashed). 



3 Application of the Model: A Particle Sizing 
Technique 

The model presented above has two important characteristics: it is very fast, 
and it is reliable in reproducing the main properties of the scattering pat- 
terns. One of these properties, the position of the intensity minima, shows 
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clear regularities in its behavior when the size of the particle or the angle of 
incidence change. An extensive analysis has to be carried out in order to ex- 
tract an analytic expression to characterize this behavior. Figure 3 shows the 
full-scan scattering pattern calculated by the model for S polarization within 
the plane of incidence for an i? = 0.55 /im cylinder whose axis is perpen- 
dicular to that plane of incidence. The figure also shows the corresponding 
experimental results for such situation [11]. The positions of the intensity 
minima for the scattered light are approximately the same for the cylinder 
and sphere, although the scattering patterns are very different. 




0^ ( Deg ) 



Fig. 3. Scattering pattern of an A = 0.55 fim cylinder on a snbstrate (both gold 
coated), 9i = 30°. Circles show experimental results and the continuous line shows 
double-interaction-model results including the shadowing effect. 



3.1 Results: General Expression for Oblique Incidence 

A particular scattering minimum can be represented by an index m. The 
order m = 0 corresponds to the minimum farthest from the specular peak 
(nearest grazing incidence). The order m increases towards the specular peak. 
Depending on the incident angle, there are two minima of each order on 
either side of the specular peak. To distinguish between these two sets, we 
refer to minima on the side of the specular peak including the back-scatter 
direction as the backward minima and those occuring on the other side of 
the specular peak as forward-scatter minima. Consequently, we define two 
scattering regions: the forward region, corresponding to Og > 0i and the 
backward region, corresponding to Os < di- A great number of patterns similar 
to those of Fig. 3 have been calculated using the double-interaction model for 
different size parameters to study the behavior of the minima. Fig. 4 shows 
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one example of the minima for cylinders illuminated at 9i = 20° plotted as a 
function of the inverse of the relative diameter D,. = 2R / A. 





( DA ) 



Fig. 4. Theoretical angular positions of minima for Oi — 20° as a function of the 
relative diameter for cylinders on a substrate. The forward region {9s > 9i) is shown 
on the top and the backward region {9s < 9i) is shown on the bottom. 



The behavior of the minima is approximately linear with (A) Plots 
for spheres are almost identical to those obtained for cylinders, meaning that, 
though the scattering patterns are different, the behavior of the minima are 
very similar. A general expression may be found relating the relative size to 
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the position of a minimum of order m. Forward and backward minima must 
be considered separately. The forward minima {9 > 9i) can be described by 



e . - 90 - ^ 

jj 



(15) 



In the above equation, the position is inversely proportional to Dr, where 



af{9i,m) = a{9i) + b{9i)m (16) 

a{9i) = 33.47 (±0.08) - 0.097 (±0.010) 9, (17) 

h{9i) = 36.40 (±0.05) - 0.149 (±0.006) 9, (18) 

Similarly, the backward minima {9 < 9i) can be described by 

^min = “90 ± -jy (19) 

where 

ab{9^, m) = a'{9i) ± b'{9^)m (20) 

a' (9,) = 33.55 (±0.08) ± 0.153 (±0.010) 9, (21) 

b'{9,) = 36.22 (±0.06) ± 0.170 (±0.007) 9, (22) 



Obviously, 9mm > 9i in the first case, and 9mm < 9i in the second. Both 9mm 
and 9i are expressed in degrees. These expressions generalize that of [10] in 
which normal-incident illumination is considered. An evaluation of Dr (and 



Table 1. Values of D (5th column) obtained from the experimental patterns, corre- 
sponding to particles of approximate nominal size Dth ■ N is the number of minima 
used in the fitting process. AD is the standard deviation, expressed in percent error 
in the last column. 



Type 


Ah(fim) 


6»i (Deg) 


N 


D(fim) 


AD 


% 


Cylinder 


1.1 


0 


4 


1.11 


0.07 


(6.3) 






10 


4 


1.07 


0.07 


(6.5) 






20 


4 


1.10 


0.08 


(7.3) 






30 


3 


1.08 


0.09 


(8.3) 


Sphere 


1.1 


8 


4 


1.12 


0.08 


(7.1) 






30 


3 


1.13 


0.09 


(8.0) 


Sphere 


3.2 


8 


16 


3.13 


0.27 


(8.6) 






30 


10 


3.19 


0.38 


(12) 
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D = DrX) has been made by means of (15) and (19) for several samples. The 
angular positions of the minima have been obtained experimentally from the 
full-scan scattering patterns similar to those shown in Fig. 3, with an angular 
resolution of 1°. Table 1 shows the values obtained from measurements at 
different angles of incidence. 

This particle-sizing method is simple, but often not accurate for many 
applications (see, for instance, [17]). However, the possibilities of the method 
have not been exploited yet, as we will see in the next subsection, where 
another procedure is described. 



3.2 Tracking the Minima 

The positions of the minima predicted by the model are almost coincident to 
those predicted by an exact solution (see Fig. 2) from normal incidence out 
to approximately 9i = 30° . This allows us to track the position of a minimum 
as the angle of incidence changes, and then compare the experimental evolu- 
tion with that predicted by the model. The process is quite simple, avoiding 
the measurement of the full scattering patterns, and increasing the amount 
of information concerning that minimum. Once the scattering minima have 
been located for a given angle of incidence (usually normal incidence) , one of 
these minima is selected, and the detecting system continuously scans a small 
interval around it. As the angle of incidence changes, the system tracks the 
minimum, obtaining a curve 0min (w) versus 9i. This process can be carried 
out for any minima having measurable visibility. 



Table 2. Values of D obtained from the experimental patterns, corresponding 
to particles of approximate nominal size Dth. m corresponds to the order of the 
minimum which position has been fitted, either to the predictions of the model 
or to a linear behavior which cut at 9i = 0° also serves the purpose of giving an 
estimate of D. The letter after the order stands for the forward (f) or the backward 
(b) minimum of order m. The interval in which the fit has been carried out is also 
specified. 



Type 


Dth 


m 


Interval 
[Deg, Deg] 


D(/im) 


AD(%) 


Model fit 


1.1 


If 


[0,16] 


1.079 


0.005 (0.5) 


Model fit 


1.1 


lb 


[0,20] 


1.077 


0.004 (0.4) 


Linear 9i = 0° 


1.1 


If 


[0,12] 


1.072 


0.005 (0.5) 


Linear 9i = 0° 


1.1 


lb 


[0,12] 


1.071 


0.004 (0.4) 



Two of these curves are shown in Fig. 5 for the case of a cylinder of 
approximate size D = l.\ fxm. The corresponding fit values are shown in the 
first part of Table 2. The use of the model may be minimized combining this 
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0, (Deg) 

Fig. 5. Experimental (squares) and model (dots) predictions of the minima positi- 
ons of order m — 1 as a. function of the incident angle for a D = 1.1 /im cylinder. 
The forward region is shown on the top and backward region on the bottom. 



procedure with the use of (15) and (19). These curves {0min/^i} have always 
a linear behavior up to about « 10°. We may obtain an accurate value of 
^^min for normal incidence by means of a linear fit. The size will be obtained 
by direct substitution in (15) and (19) - the result of this procedure is shown 
in the second part of Table 2. 
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3.3 Polydisperse Samples 

Polydisperse samples contain particles having different properties. Most com- 
monly, we consider a sample of particles having polydispersity in size, for in- 
stance, a distribution of spheres of identical material whose radii vary. Such 
samples are common in nature and in manufactured processes. We discuss 
methods of analysis of the scattering intensities from samples displaying high 
and low size polydispersities. These two different polydispersities require dif- 
ferent analyses. We characterize the polydispersity by a gamma-type distri- 
bution because of its versatility 



p{R, /i, a)dR 



Rfj, exp (—aR) 



dR 



(23) 



with 



Rr. 



d- = 






(24) 



and i?o is the mean radius of the particle and cr^ the size variance. We also 
define the degree of polydispersity as 



_ 1 

r = n 2 



a 




(25) 



Low Polydispersity Distributions with r < 0.1 are considered here as 
having low polydispersity. These include, for example, samples of well cha- 
racterized particles supplied by manufacturers. Another polydisperse sample 
may include a fiber whose radius varies along its length. In Fig. 6 the model 
calculations of a scattering pattern corresponding to a monodisperse sam- 
ple of spherical particles is compared with those of samples having different 
degrees of polydispersity. The immediate and obvious effect that a size dis- 
tribution has on the scattering patterns is the loss of visibility in the pattern. 
This is due to the fact that different size microstructures have their minima 
at different angular positions. The incoherent addition of these scattering 
patterns causes a loss of visibility in the total measured scattering pattern. 
For high size polydispersity, the scattering patterns eventually lose all lobed 
structure. 

In Fig. 6, the lobed pattern is still visible because of the relatively low 
degree of polydispersity. One very interesting feature is that the angular posi- 
tions of the minima are approximately the same as those of the corresponding 
monodisperse system whose radius is the mean of the polydisperse sample. It 
can also be observed that the minima of higher order lose their visibility for 
lower polydispersities, while the lower orders still remain visible for not so 
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Fig. 6. Light scattering intensities from polydisperse spherical particles on a sub- 
strate illuminated at normal incidence as predicted by the double-interaction model 
including shadowing effects. The mean relative value is Ro,r = Ro/^ = 2.53 and 
r = 0% (dots), r = 3% (solid) and r = 6% (dashed) are shown. 



low polydispersities. In Fig. 7 the evolution of the visibility as a function of 
the degree of polydispersity is plotted for several orders. In principle, a know- 
ledge of the visibility for each order minima allows an estimate of the degree 
of polydispersity to be made. This, added to the fact that it is still possible 
to determine the central monodisperse value by means of the positions of the 
minima, leads to a characterization of the sample. In particular. Fig. 7 also 
shows the experimental data obtained from a cylinder of 1.1 /rm diameter 
showing a nominal polidispersity of about 4% and illuminated along 2.5 mm. 



High Polydispersity and Enhanced Back-Scattering For high poly- 
dispersity r > 0.25 the lobes in the scattering pattern become completely 
smoothed and can no longer be used to characterize the scattering system. 
To quantify the scattering system, we must resort to other methods. One use- 
ful method tool is the enhanced-backscatter peak. This is a coherent effect 
whose origin can be found in the superposition of the components described 
in Sect. 2. In the back-scattering direction, contributions of the once-reflected 
beams (components 2 and 3) always add constructively. This generally results 
in a peak in the back-scatter direction. High polydispersity helps to smooth 
out any remaining structure, so that the enhanced-backscatter peak is the 
only significant structure remaining. The width of this peak is inversely pro- 
portional to the size of any characteristic surface structures, and can be used 
to partially characterize the sample. The peak width is not only dependent on 
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Fig. 7. Evolution of the visibility V(m) for some values of r (expressed in %). 
Circles correspond to the measurable points (m=integer number). Dots lines with 
circles: Results predicted from the model. Dots lines with triangles: Experimental 
results for a Cylinder of approximate diameter D = 1.1 /rm showing a polydispersity 
of about 4%. 
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Fig. 8. Shape of the enhanced-back-scatter peak predicted by the model, for a sam- 
ple of gold spherical particles of relative mean radius Ro,r = Ro/X = 1 illuminated 
at 9i = 70°. Polydispersities of r = 0.5 (continuous line), r = 1 (dots line), and 
r = 2 (dashed line) are shown. 
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Fig. 9. Evolution of the back-scattering peak width as a function of polydispersity 
r. The top graph shows samples of Ro,r ~ 0.5 (dots), Ro,r ~ 1 (squares), and 
Ro,r = 2 (crosses) illuminated at 6i = 70°. The bottom graph shows samples of 
Ro,r = 1 illuminated at 9i = 40° (dots), 6i = 60° (squares), and 9i = 80° (crosses). 
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the polydispersity, but also on the mean size and angle of incidence. Figure 
8 shows the different shapes of the backscattering peak for different values of 
the degree of polydispersity r for = 70° and Ro/X= 1. The width increases 
and the symmetry of the peak decreases as the value of r grows. In Fig. 9 
the width is represented as a function of r and 9i. 

The fact that several combinations of r and Ro r produce the same peak 
width introduces an uncertainty in the determination of the mean size from 
the measurement of the width. In particular, a group of distributions with 
a constant product rRo^r produces approximately the same peak width. By 
itself, the experimental detection of a peak only provides information about 
which member of the family ri?o,r the microstructures correspond. 

4 Conclusions 

The light scattered from a particle on a flat substrate may be approximately 
calculated from the knowledge of the scattering from the isolated particle. 
The double-interaction model is useful because of its simplicity, transparency, 
and speed at which it can be used to calculate and estimate properties of the 
particle system. Using this model, simple relationships can be established 
between scattering minima positions and contaminant size. The interference 
pattern has its origin in the particle uniformity, and a size polydispersity 
produces a progressive loss of visibility in the interference, which eventually 
completely destroys the lobed pattern. For the case of high polydispersity, 
the constructive interference of two of the components in the back-scatter 
direction allows us to make a quantitative measure of the scattering system. 
Although these principles have been applied to spherical and cylindrical me- 
tallic particles, the double-interaction method may be applied to other scat- 
tering systems. The characteristic evolution of the back-scattering patterns, 
the study of the out-of-plane scattered light, and the comparison between size 
and shape polydispersity for irregular particles are some of the applications 
for which this simple method may be applied. 
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